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Foreword 



My first touch with Code Division Multiple Access (CDMA) was during 
my early days at the European Space Agency (ESA) when I was involved 
with the development of an accurate geostationary satellite tracking system 
exploiting Direct Sequence CDMA. I distinctly recall the surprise to hear 
from my supervisor that “the spread spectrum technique allows transmitting 
signals below the thermal noise floor”. The statement was intriguing enough 
for me to enthusiastically accept working on the subject. 1 immediately fell 
in love with CDMA systems, as they soon revealed (both to my dismal and 
to my pleasure) being complex enough to keep me busy for more than a dec- 
ade. 

Shortly after moving to the ESA’s main R&D establishment in the Neth- 
erlands, I started to regard CDMA as a potential candidate for satellite fixed 
and mobile communication networks. It was a pioneering and exciting time, 
when CDMA was still confined to military, professional and navigation ap- 
plications. At ESA we developed preliminary architectures of CDMA sys- 
tems featuring band limited signals, and free of self noise interference trough 
a simple yet efficient approach based on tight code epoch synchronization. 
Concurrently, we also started the earliest CDMA digital satellite modems 
development. The laboratory experiments unveiling the ups and downs of 
(quasi-)orthogonal CDMA interference where shortly after followed by the 
first satellite tests. 

At that time a small US-based company named Qualcomm was moving 
the first steps in making CDMA technology for terrestrial cellular telephony 
truly commercial. And the fact that the co-founders of this small company 
were Dr. A.J. Viterbi and Dr. I.J. Jacobs convinced the management of ESA 
to financially support our modest R&D effort. While the ‘religious’ battle 
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between the TDMA and CDMA terrestrial armies was taking momentum, in 
our little corner we went on studying, understanding, experimenting, and 
improving on CDMA technologies. 

I had then the pleasure to closely follow the development of ESA’s first 
mobile and fixed CDMA satellite networks while witnessing the commercial 
deployment of the first terrestrial CDMA networks (IS-95), and directly par- 
ticipating to the early tests with the Globalstar satellite mobile telephony sys- 
tem during my stay at Qualcomm in ‘96-‘97. Since then CDMA technology 
started becoming the subject of industry courses, University lectures, and 
was often appearing on the front page of non-technical newspapers and 
magazines. 

The final battle corresponded to the selection of CDMA in several flavors 
as the air interface for the 3 rd Generation (3G) of personal communication 
systems: Universal Mobile Telecommunication Systems (UMTS) in Europe 
and Japan, and cdma2000 in the Americas. During the early days I also con- 
vinced my friend and former ESA colleagues Marco Luise and Filippo 
Giannetti, shortly followed by Luca Fanucci, to join the excitement and the 
frustrations of the satellite CDMA camp, and this was maybe the initial seed 
that later bloomed into this book. 

While ‘classical’ CDMA technologies where getting commercially de- 
ployed, a truly remarkable investigation effort was taking place in the aca- 
demic world about the issue of Multi User Detection (MUD) and Interfer- 
ence Mitigation (IM). MUD issues attracted the interest of hundreds of re- 
searchers around the world despite an initial skepticism about its effective- 
ness. With the authors of the book I was also ‘contaminated’ by the idea to 
develop more advanced CDMA detectors which can autonomously remove 
the CDMA self noise. But browsing hundreds of papers on the subject, we 
were still missing inspiration for some technique which can be readily im- 
plemented in the user terminal of a satellite network. 

Finally, in the mid nineties we get acquainted with the work by Honig, 
Madhow and Verdu, and so we get convinced that interference mitigation 
could be really done and could work fine in a wireless satellite network. This 
was the beginning of the endeavor described in this book, where a small 
group of people from Academia, with the due technical support from a big 
semiconductor firm, where able to put together possibly the first ASIC-based 
CDMA interference mitigating detector ever. But this is just the beginning of 
a new era which I am sure will be as exciting as the previous decade. 

Probably the most prominent Italian novelist, Alessandro Manzoni 
(1785-1873) used to modestly address his largely vast readership as “my 
twenty-five readers”. I am convinced that this book, too, will find (not the 
same!) twenty-five people that will enjoy and appreciate the spirit and les- 
sons learnt during this remarkable adventure, as if they were themselves part 
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of the team which carried out this exciting project financed by the ESA 
Technology Research Plan. 

Riccardo De Gaudenzi 

Head of the Communication Systems Section 
European Space Research and Technology Centre 
European Space Agency 

Noordwijk (The Netherlands), July 2003 
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Chapter 1 

INTRODUCING WIRELESS 
COMMUNICATIONS 



“Life will not be the same after the wireless revolution This is certainly 
true at the moment for countries in the Western world, and is going to be 
true in a few years for developing coun tries as well. So the aim of this Chap- 
ter is first to address the main terms of this revolution from the technical 
standpoint and to review the main second- and third-generation worldwide 
standards for wireless cellular communication, then to discuss how satellites 
can play a role in this scenario, and finally to show how this ‘revolution ’ 
could have taken place through the tremendous technological progress of 
(micro-) electronics. 



1. THE WIRELESS REVOLUTION 

In many European countries the number of wireless access connections 
between the user terminals (cellular phones, laptops, palmtops, etc.) and the 
fixed, high capacity transport network has already exceeded the number of 
wired connections. Untethered communications and computing has ulti- 
mately become part of a lifestyle, and the trend will undoubtedly go further 
in the near future, with the commercialization of low cost Wireless Local 
Area Networks (WLANs) for the home. Round the corner we may also en- 
visage pervasive, ad hoc wireless networks of sensors and user terminals 
communicating directly with each other via multiple hops, and without any 
need of support from the transport network. 

The picture we have just depicted is what we may call the wireless revo- 
lution [Rap91], Started in Europe in the early 90s, with the American coun- 
tries lagging by a few years, it will probably come to its full evolution within 
the end of the first decade of the third century, to rise again in a second great 
tidal wave when the Asian developing countries will catch up [Sas98], The 
real start of the revolution was the advent in Europe of the so called 2 nd Gen- 
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eration (2G), digital, pan-European cellular communication systems, the 
well known GSM (Global System for Mobile communications) [Pad95], The 
explosive growth of cellular communications had already started with earlier 
analog systems, the so called 1 st Generation (1G), but the real breakthrough 
was marked by the initially slow, then exponential, diffusion of GSM termi- 
nals, fostered by continent wide compatibility through international roaming. 
In the United States the advent of 2G digital systems was somewhat slowed 
down by the co-existence of incompatible systems and by the consequent 
lack of a nation wide accepted unique standard. The two competing 2G 
American standards are the so called ‘digital’ AMPS (Advanced Mobile 
Phone System) IS- 154 whose technology was developed with the specific 
aim of being compatible (as far as the assigned RF channels are concerned) 
with the pre-existing 1G analog AMPS system, and the highly innovative 
Code Division Multiple Access (CDMA) system 1S-95. 

In the second half of the 90s the GSM proved highly effective, boomed in 
Europe, and was adopted in many other countries across the whole world, 
including Australia, India, and most Asian countries. The initial European 
allocation of radio channels close to 900 MHz was paired by an additional 
allocation close to 1800 MHz (DCS- 1800 system) that led to the tripling of 
system capacity. GSM techniques were also ‘exported’ to the United States 
under the label of PCS (Personal Communication Systems) with an alloca- 
tion of channels close to 1900 MHz. At the turn of the century GSM, 
through its mature technology, started to be exploited as a true born digital 
system, delivering multimedia contents (paging, messaging, still images, and 
short videoclips). It is also being extended and augmented into a packet ac- 
cess radio network through the GPRS (Generalized Packet Radio Service) 
access mode (Figure 1-1), and will also be augmented to higher capacity 
through the EDGE (Enhanced Data rate for Global Evolution) technology. 
Both GPRS and EDGE are labeled ‘2.5G technologies’, since they represent 
the bridge towards 3 rd Generation (3G) systems which will be discussed 
later. 




Figure l-l . GSM/GPRS Network Architecture (http://www.gsm.org). 
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A similar evolution has taken place in the United States with CDMA 1S- 
95 2G systems (Figure 1-2) [Koh95], [Gil91]. After a controversial start in 
the area of California, CDMA systems were early adopted in Brazil, Russia, 
and Korea. They soon evolved into an articulated family of different systems 
and technologies called cdmaOne, all based on the original standard 1S-95 
and its evolutions. After ‘cellular CDMA’ at 800 MHz was launched its PCS 
version at 1900 MHz was soon made available. Packet access was embedded 
into the system, and a standard for fixed radio terminals to provide fixed 
wireless access to the transport network was also added. We shall not insist 
further on the evolution of 1G and 2-2. 5G systems in Japan, not to play 
again a well known song. 
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Figure 1-2. Network architecture of an IS-95 CDMA system (http://www.cdg.org). 

At the dawn of the third millennium the ITU (International Telecommu- 
nications Union), based in Geneva, took the initiative of promoting the de- 
velopment of a universal 3G mobile/personal wireless communication sys- 
tem with high capacity and a high degree of inter-operability among the dif- 
ferent network components, as depicted in Figure 1-3. Under the initiative 
IMT-2000 (International Mobile Telecommunications for the year 2000) 
[Chi92] a call for proposals was issued in 1997 to eventually set up the 
specifications and the technical recommendations for a universal system. At 
the end of the selection procedure, and in response to the different needs of 
the national industries, operators, and PTTs, two different non-compatible 
standards survived: UMTS (Universal Mobile Telecommunication System) 
for Europe and Japan, and cdma2000 for the USA. Both are based on a mix- 
ture of time and code division multiple access technologies. UMTS stems 
from a number of research projects carried out in the past by Europe and Ja- 
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pan (mainly FRAMES in Europe [Dah98] and CORE-A in Japan [Ada98]), 
whilst cdma2000, following a consolidated tradition in the standardization 
procedures in the United States, is a backwards compatible evolution of 2G 
CDMA [Kni98], 3G systems are being developed at the time of the writing 
of this Chapter (early 2003), with Japan leading the group. Some are ques- 
tioning the commercial validity of 3G systems (but this is something com- 
pletely outside the scope of this book), other say that 3G will not reveal such 
a breakthrough as 2G systems have admittedly been. We will say more on 
2G/3G systems in Section 1.2. 




Figure 1-3. IMT-2000 system concept (http://www.itu.org). 



The next wave of the wireless revolution may possibly come from 
WLANs [Nee99]. A WLAN is not just a replacement of a traditional wired 
LAN (such as the ubiquitous Ethernet in one of its 10/100/1000 Mbit/s ver- 
sions). Many forecasts envisage, in fact, a co-existence between wired cop- 
per LANs to link fixed PCs within an office (or a building) and wireless 
networks (the WLANs) yielding high bit rate together with a certain support 
of mobility and handovers. With WLANs laptops, palmtops, possibly port- 
able mp3 players and/or videoterminals, are all linked together, either via a 
central access point in a star topology (with immediate provision of connec- 
tivity with the fixed network), or directly with each other in an ‘ad hoc’, de- 
centralized architecture. The former architecture is typical of IEEE 802. 1 1 a- 
b networks that are at the moment gaining more and more popularity; the 
latter is the paradigm of Bluetooth pico-nets/scatter-nets and of IEEE 802.15 
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‘ad hoc’ communications. WLANs are becoming the standard untethered 
connection for nomadic computing, and may challenge UMTS when full 
mobility is not a fundamental requirement. Still to come are WLANs for the 
home (such as HomeRF and similar products currently being developed) 
which belong more to the field of consumer electronics than to telecommu- 
nications. The physical layer of most efficient WLANs (802.1 lb) is based on 
multi-carrier modulation, borrowed from the fields of TV terrestrial broad- 
casting (Digital Video Broadcasting — Terrestrial, DVB-T) and Digital Sub- 
scriber Line access techniques (xDSL). 

At the present time, people in the field of R&D for telecommunications 
speak of 4 th Generation (4G) systems. At the moment nobody actually 
knows what 4G is going to be. The main trend for the physical interface is to 
combine CDMA for efficient access and frequency re-use, and multi-carrier 
transmission (as in WLANs) in order to cope best with radio propagation 
channel impairments, into a Multi-Carrier CDMA (MC-CDMA) signal 
transmission format. 4G networks are also expected to yield maximum ca- 
pacity and flexibility. This means being able to integrate all of the scenarios 
mentioned above (traditional cellular systems for mobile communications, 
fixed wireless access systems, wireless LANs, ‘ad hoc’ network) into a sin- 
gle fully inter-operable, ubiquitous network. 

The successful implementation of all of the different wireless systems’ 
generations has relied, relies, and will rely on the formidable performance 
growth and cost/size decrease of Very Large Scale Integrated (VLSI) com- 
ponents. This fundamental enabling factor will be discussed in greater detail 
in Section 1 .4, and is the pivot of all of the work described in this book. 



2. 2G AND 3G WIRELESS COMMUNICATION SYS- 
TEMS IN EUROPE AND THE USA 

Both 2G and 3G systems are based on the concept of cellular communi- 
cations and channel frequency re-use [Kuc91], The concept of a cellular ra- 
dio network is well known: the service area to be covered by a provider is 
split into a number of cells (usually hexagonal, as in Figure 1-4). Each cell is 
served by a Radio Base Station (RBS) which manages a number of channels 
whose center frequencies lie within the radio frequency spectrum allocated 
to that provider. In doing so, and with some specific techniques to be pre- 
sented in a while, the same channels can be re-used in different cells, thus 
allowing them to serve a population of active users much larger than the 
mere number of allotted channels. 

The main difference between GSM and IS-95 (the two European and 
American 2G digital cellular systems, respectively) lies in the way channel 
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allocation within each cell is carried out. GSM is based on a mixture of Time 
ivision Multiple Access (TDMA) and Frequency Division Multiple Access 
(FDMA) to grant multiple access in the uplink, and on an equivalent 
TDM/FDM scheme for downlink multiplexing. Specifically, 8 TDMA chan- 
nels are allocated to a single carrier, and the different carrier are spaced 200 
kHz apart. Both with TDMA and with FDMA all channels can not be used in 
each cell, since that arrangement would give rise to excessive inter-cell in- 
terference. The latter comes from the possibility of co-existent active chan- 
nels on the same frequency and/or in the same time slot in adjacent cells. 
The solution of this issue is the technique of channel frequency re-use. As is 
shown in Figure 1-4, the cells on the coverage area are arranged into clusters 
(in the example, 7 cells/cluster). The total frequency band of the provider is 
then further split into chunks (as many chunks as cells in a cluster) of non- 
overlapping adjacent frequency channels (represented in Figure 1.4 by dif- 
ferent shades of gray). The chunks are then permanently allocated to the 
cells of a cluster in such a way that cells using the same chunk of channels 
are sufficiently spaced so that the level of inter-cell interference is harmless 
to the quality of the radio link. This apparently places a limit on the overall 
network capacity in terms of channels/cell, which directly translates into a 
limitation of the served users/unit area. 




Figure 1-4. Cellular radio network with frequency re-use (http://www.cdg.org). 

IS-95 is based on a different concept for multiplexing and multiple ac- 
cess. In the downlink the different channels, instead of different time slots or 
different carriers, are placed ‘onto’ different spreading codes, and they are 
transmitted at the same time and on the same carrier frequency. Such codes 
are taken from a set of orthogonal functions (the Walsh— Hadamard, or WH, 
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sequences) in such a way that each channel can always be extracted from the 
‘mixture’ of all of the downlink channels with no crosstalk from the others. 
This is the basic idea of Code Division Multiplexing (CDM) [DeG96] that 
will be discussed in greater detail in Chapter 2. The codes used for multi- 
plexing are binary digital waveforms whose clock is faster than the data 
clock, and so they cause a spectral spreading of the data signal: CDM is in- 
herently linked to Spread Spectrum (SS) modulation. The bandwidth in- 
crease factor is called the spreading factor of the spreading code, and is 
equal to 64 for IS-95. A similar concept is also used for the uplink, with a 
significant difference. The downlink WH sequences, also called channeliza- 
tion codes, stay orthogonal as long as they are synchronous. This is easily 
accomplished in the downlink, since the different tributary signals are physi- 
cally co-located in the RBS. In contrast, the uplink signals coming from the 
different mobile user terminals cannot be easily synchronized, and thus the 
spreading codes of each user cannot be made synchronous with any of the 
others. Therefore, when accessing the radio channel the CDMA signals are 
asynchronous, and this causes a residual crosstalk on each channel coming 
from all the others. This crosstalk, called Multiple Access Interference 
(MAI), can be made sufficiently small by increasing the spreading factor of 
the uplink channels. In IS-95 the gross channel bandwidth is 1.25 MHz, 
whilst the maximum data rate is 19.2 kbit/s. Of course, the MAI creates ei- 
ther an impairment on the quality of the link, or a limitation in capacity. It is 
clear, in fact, that the MAI is proportional to the number of active users in a 
cell. If the number is too large the level of MAI is too high, and incoming 
calls may be dropped, causing a capacity boundary. This is something that is 
not experienced by FDMA/TDMA systems, wherein intra-cell MAI is to- 
tally absent owing to the orthogonality of uplink signals. 

The real breakthrough of CDMA lies in the way channel re-use is han- 
dled. With CDMA, in fact, each cell, in addition to the channelization codes, 
is also given a unique scrambling code, so that each signal is ‘doubly en- 
coded’. The first level of coding (channelization) is necessary within the cell 
to make the different channels separable; the second level (scrambling) is 
necessary between adjacent cells to make signals arising from a different cell 
separable and not interfering; it is something similar to frequency re-use in 
FDMA/TDMA. But here there is actually no frequency re-use: thanks to the 
presence of the scrambling code, adjacent cells may use the same carrier fre- 
quency without creating an excessive level of inter-cell interference. Such an 
arrangement is represented in Figure 1-5, in which all cells are shaded the 
same way because all cells use the whole allocated bandwidth: it is the uni- 
versal frequency re-use. The channelization code allows the re-use all of the 
channelization codes in all of the cells, thus increasing overall capacity in 
terms of active users per square km with respect to FDMA/TDMA with fre- 
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quency re-use (Figure 1-4). Other features of CDMA (such as the use of 
channel codes to further increase capacity, the robustness to multi-path radio 
propagation, the possibility of performing seamless soft handover of com- 
munications between adjacent cells, and so on) have all contributed to make 
1S-95 and its evolutions a success. 




Figure 1-5. CDMA universal frequency re-use (http://www.cdg.org). 

But all 2G systems were substantially geared towards providing good- 
quality voice communications with limited data communication capabilities 
(just enough for paging and messaging services, and possibly for e-mail). 
Data channels were limited to a mere 9.6 or 14.4 kbit/s, which seemed satis- 
factory at the time of issuing the standards, but was revealed a few years 
later as patently inadequate for providing mobile Internet services and mul- 
timedia services in general. In the early 90s, therefore, different initiatives 
were taken in the United States, in Europe, and in Japan to develop an 
evolved 3G system with enhanced features: universal worldwide compatibil- 
ity and roaming, increased capacity (up to 2 Mbit/s for fixed wireless ser- 
vices and 384 kbit/s with full mobility), support for co-existing multi-rate 
connections (typical of multimedia applications), fast packet access with 
‘always on’ connections, and others. Such requirements led to the develop- 
ment of UMTS in Europe and Japan, and of cdma2000 in the Americas as 
mentioned in Section 1 . They are both based on different forms of wideband 
CDMA (W-CDMA) technologies, where ‘wide’ is intended to refer to 2G 
system. The nominal bandwidth of a UMTS carrier is, in fact, 5 MHz, as 
opposed to the 1.25 MHz of 1S-95. Both wideband CDMAs implement up- 
link/downlink full duplexing via Frequency Division Duplexing (FDD) and 
allocation of paired bands: for instance, European UMTS places the uplink 
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in the 60 MHz band 1920-1980 MHz, and the downlink in the paired 2110- 
2170 MHz band. UMTS also encompasses an additional (optional) hybrid 
TDMA/CDMA mode with Time Division Duplexing (TDD) on the narrower 
unpaired bands 1900-1920 and 2010-2025 MHz, for a total of 35 MHz. 

UMTS gained the headlines in newspapers worldwide around 2000 coin- 
ciding with the auctions of the frequency licenses, which took place in sev- 
eral European countries. During such auctions the cost of spectrum licenses 
in countries such as the United Kingdom and Germany reached unprece- 
dented levels for mobile operators. The large expectation built around 3G 
mobile wireless systems, of which UMTS represents the European interpre- 
tation, has not materialized yet, unfortunately. However, despite the delays 
in the UMTS commercial roll out and the scepticism affecting the telecom- 
munication world as a whole, UMTS will certainly play a key role in the de- 
velopment of multimedia wireless services. Although it is difficult to predict 
which kind of avenues 3G services will take, it is clear that the availability 
on the same device (i.e., the user terminal) of multimedia interactive services 
combined with accurate localization will open up astonishing possibilities 
for service providers. In addition to the current voice and short-message ser- 
vices, mobile users will be able to access the Internet at considerable peak 
speeds, download and upload documents, images, MP3 files, receive loca- 
tion dependent information, and so on. The mobile terminal functionalities 
will be greatly extended to make it a truly interactive Personal Digital Assis- 
tant (PDA) always connected to the Internet world and voice will just be- 
come one of the many multimedia services available at the user’s fingertips. 



3. THE ROLE OF SATELLITES IN 3G SYSTEMS 

UMTS was originally intended to be made of a terrestrial and a satellite 
component (denoted as T-UMTS and S-UMTS, respectively) integrated in a 
seamless way. The economic troubles experienced by 2G satellite Global 
Mobile Personal Communication Systems (GMPCSs) such as Iridium and 
Globalstar have largely mitigated the investor enthusiasm for satellite based 
ventures. Those GMPCSs systems are based on large Low Earth Orbiting 
(LEO) 48-64 satellite constellations aimed to provide GSM-like services 
from hand held terminals with worldwide coverage. However, despite the 
severe financial difficulties encountered by LEO GMPCS, new regional sys- 
tems based on geostationary (GEO) satellites, such as Thuraya and AceS, 
have been put in operation recently. Inmarsat, the first and most successful 
satellite mobile operator, intends to put into orbit new powerful GEO satel- 
lites providing UMTS-like services in the near future. 
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Since 1999 the European Space Agency (ESA) has carried out a number 
of system studies and technological developments to support the European 
and Canadian industry in the definition of S-UMTS component development 
strategy, identifying critical technological areas and promoting S-UMTS 
demonstrations [Bou02], [Cai99]. Particular effort has been devoted to the 
following key aspects: 

i) study and optimization of system architectures providing appealing fea- 
tures to possible operators; 

it) design, testing, and standardization within international bodies of an air 
interface with maximum commonality with terrestrial UMTS; 

Hi) development and validation of real time demonstrators for laboratory 
and over the air S-UMTS experiments; 

iv) design and development of large reflector antennas; 

v) design and prototyping of an advanced high throughput On Board 
Processor (OBP) for future mobile missions; 

vt) networking studies and simulations. 

Furthermore, ESA, in strict cooperation with the European Commission, 
has promoted the creation of the Advanced Mobile Satellite Task Force 
(ASMS-TF) which has now attracted more than 45 worldwide key players 
working together to define and defend the role of satellite in 3G mobile and 
beyond. 

The main new opportunities identified for S-UMTS and more in general 
for Advanced Mobile Systems by ESA studies and the ASMS-TF are: 

Development of Direct Mobile Multicasting & Broadcasting: address- 
ing mainly the consumer market, as well as specific corporate markets. 
This kind of system would overlay the terrestrial cellular networks by im- 
plementing ‘point to multi point’ services in a more efficient manner, and 
would complement terrestrial digital broadcast networks by extending and 
completing their geographical coverage. 

Extension of the wireless mobile terrestrial networks: i.e., coverage 
extension, coverage completion, global roaming, and rapid deployment. It 
is expected that in this market segment satellite mobile systems will re- 
quire a much higher degree of integration with terrestrial infrastructures 
than for extensions of existing mobile satellite systems. 

A possible S-UMTS system architecture encompassing both opportuni- 
ties listed above is presented in Figure 1-6. In this case the satellite provides 
‘Point to Point’ (P2P) unicast services for T-UMTS geographical comple- 
ment, as well as T-UMTS complementary broadcast and multicast services. 
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The latter will profit from the presence of a terrestrial gap filler network 
which will boost the Quality of Service (QoS) in densely populated areas 
where the satellite signal is often too weak to be received. T-UMTS cover- 
age extension may reveal an attractive solution for providing UMTS services 
to regions with low density population. The satellite user link is at S band (2 
GHz) as for T-UMTS, whilst the service link between the gateway stations 
and the satellite is at Ka band (20-30 GHz). The service links are mapped to 
the user beams by means of a bent pipe satellite transponder. For broadcast 
services, only the forward link (i.e., the gateway — > satellite — > user link) is 
implemented. For geographical T-UMTS extension a reverse link will be 
also implemented to provide a user — » satellite — » gateway connection. Ter- 
restrial gap fillers receive the broadcast satellite signal directly from the sat- 
ellite on a dedicated Ka band link and perform frequency downconversion 
on the same S-UMTS 2 GHz bands as above. 
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Figure 1-6. Satellite-UMTS Architecture. 

Terrestrial UMTS coverage extension calls for the generation of a large 
number of relatively small radio beams (compared to usual satellite beams, 
see Figure 1-7 as an example) generated by very large antenna reflectors 
(20-25 m). This allows the achievement of a considerable frequency re-use 
factor, similarly to what is done in a terrestrial cellular network (a cell is re- 
placed here by a beam). But, as stated above, S-UMTS may also comple- 
ment terrestrial UMTS in terms of services. This is why the concept of direct 
mobile digital multimedia multicasting and broadcasting is considered to be 
the best chance for satellite systems to access the mass mobile market. Ter- 








12 



Chapter 1 



restrial UMTS networks are based on very small cells (of a few km radius) 
which allow the provision of high peak rates (up to a few hundreds kbit/s) 
and attain a large frequency re-use. This is possible thanks to the CDMA 
technology which is at the core of the UMTS radio interface. However, the 
technical solution becomes very inefficient when the UMTS networks have 
to be used to transmit the same information to many users (e.g., video clips 
containing highlights of sport events, financial information, broadcasting of 
most accessed web pages, etc.). Broadcast information may be locally stored 
in the user terminal ( cacheing ) and accessed when required by the user. In 
this case a satellite broadcast layer with large cell sizes (Figure 1-8) on top 
of T-UMTS will provide a cheaper solution for this kind of services. Satellite 
systems have a major advantage in broadcasting information since a single 
satellite can cover regions as large as Europe or USA. Good quality of ser- 
vice can be achieved by means of powerful error correction techniques and 
by an integrated terrestrial gap filler network. 
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are re-using the same frequency (see Figure 1-9). In the case of terrestrial 
gap fillers other co-frequency transmitters also generate further co-channel 
interference. So the issue of co-channel interference mitigation to increase 
system capacity is pivotal to the economical development and deployment of 
(S-)UMTS. 




Figure 1-8. Sample broadcast footprint (courtesy of Alenia Spazio, Italy). 

sgm 

Figure 1-9. Interference pattern for a European multi-beam UMTS satellite (courtesy of 

Alenia Spazio, Italy). 
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4 . VLSI TECHNOLOGIES FOR WIRELESS COM- 
MUNICATION TERMINALS 

It is an everyday experience of life to buy the best and most expensive 
cellular phone at one’s retail store and after one year or so to find the same 
item at half that price. This is just the result of the celebrated Moore’s law: 
the number of transistor on a chip with a fixed area roughly doubles every 
year and a half. Hence the price of microelectronics components halves in 
the same period, or, the power of VLSI (Very Large-Scale Integrated) cir- 
cuits doubles over the same 1 8 months. 

As a matter of fact, over the last few decades Moore’s prediction has 
been remarkably prescient. The minimum sizes of the features of CMOS 
(Complementary Metal Oxide-Semiconductor) transistors have decreased on 
average by 13% per year from 3 pm in 1980 to 0.13 pm in 2002, die areas 
have increased by 13% per year, and design complexity (measured by the 
number of transistors on a chip) has increased at an annual growth rate of 
50% for Dynamic Random Access Memories (DRAMs) and 35% for micro- 
processors. Performance enhancements have been equally impressive. For 
example, clock frequencies for leading edge microprocessors have increased 
by more than 30% per year. An example related to transistor count of Intel 
microprocessors is reported in Figure 1-10. 



transistors 




Figure 1-10. Moore’s law and Intel microprocessors (courtesy of Intel). 

This enormous progress in semiconductor technology is fueling the 
growth in commercial wireless communications systems. New technologies 
are being spurred on by the desire to produce high performance, low power, 
small size, low cost, and high efficiency wireless terminals. The complexity 
of wireless communication systems is significantly increasing with the ap- 
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plication of more sophisticated multiple-access, digital modulation and proc- 
essing techniques in order to accommodate the tremendous growth in the 
number of subscribers, thus offering vastly increased functionality with bet- 
ter quality of service. In Figure 1-11 we have illustrated the processor per- 
formance (according to Moore’s Law) together with a qualitative indication 
of the algorithm complexity increase (approaching the theoretic performance 
limits imposed by Shannon’s theory), which leaps forward whenever a new 
wireless generation is introduced, as well as the available battery capacity, 
which unfortunately increases only marginally [RabOO]. 
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Figure 1-11. Moore’s law, system complexity and battery capacity [RabOO], 



It appears that system complexity grows faster than Moore’s Law, and so a 
‘brute force’ use of the available processing power (GIPS/s) in a fully pro- 
grammable implementation is not sufficient; often dedicated hardware accel- 
erators are required. Furthermore, taking the battery capacity limit into ac- 
count, the use of dedicated hardware becomes mandatory to reduce power 
consumption. This is the fundamental trade off between energy efficiency 
(i.e., the number of operations that can be performed for a given amount of 
energy) and flexibility (i.e., the possibility to re-use a single design for mul- 
tiple applications) which is clearly illustrated in Figure 1-12 for various im- 
plementation styles [RabOO]. An amazing three orders of magnitude vriation 
of energy efficiency (as measured in MOPS/mW) can be observed between 
an ASIC (Application Specific Integrated Circuit) style solution and a fully 
programmable implementation on an embedded processor. The differences 
are mostly owed to the overhead that comes with flexibility. Application 
specific processors and configurable solutions improve energy efficiency at 
the expense of flexibility. The most obvious way of combining flexibility 
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and cost efficiency is to take the best from different worlds: computationally 
intensive signal processing tasks are better implemented on DSP (Digital 
Signal Processor) cores or media processor cores than on a microprocessor 
core, whilst the opposite is true for control tasks. 

As shown in Figure 1-13, a typical wireless transceiver combines a data 
pipe, which gradually transforms the bit serial data stream coming from the 
Analog to Digital Converter (ADC ) into a set of complex data messages, and 
a protocol stack, that controls the operation of the data pipe. Data pipe and 
protocol stack differ in the kind of computation that is to be performed, and 
in the communication mechanisms between the functional modules. In addi- 
tion, the different modules of the data and control stacks operate on time and 
data granularities which vary over a wide range. The conclusion is that a 
heterogeneous architecture which optimally explores the ‘flexibility-power- 
performance-cosf design space is the only viable solution of handling the 
exponentially increasing algorithmic complexity (which is mainly owed to 
multiple standards, adaptability and increased functionality) and the battery 
power constraint in wireless terminals. Figure 1-14 shows a typical hetero- 
geneous System on a Chip (SoC) architecture employing several standard as 
well as application specific programmable processors, on chip memories, 
bus based architecture, dedicated hardware co-processor, peripherals, and 
Inpu/Output (I/O) channels. 
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Figure 1-12. Trading off flexibility versus energy efficiency (in MOPS/mw or million of op- 
eration per mj of energy) for different implementation styles [RabOO], 
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Figure 1-13. Functional components of a wireless transceiver. 




Figure 1-14. Typical heterogeneous System-on-Chip platfonn. 

Now that the main architecture of the terminal is decided, the subsequent 
key problem is how to map the system/algorithm onto the various building 
blocks of a heterogeneous, configurable SoC architecture (hardware and 
software) within given constraints of cost and time to market. 

An extensive profiling/analysis of the application/algorithm in the early 
algorithmic design phases can help to determine the required bounds on per- 
formance and flexibility, or to outline the dominant computational pattern 
and explore data transfer and storage communications. This step is both te- 
dious and error prone if carried out fully manually, and so new design meth- 
odologies have to be provided to bridge the gap between algorithmic devel- 
opment and cost effective realization. There is a need for fast guidance and 
early feedback at the algorithm level, without going all the way down to as- 
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sembly code or hardware layout (thus getting rid of long design cycles). 
Only when the design space has been sufficiently explored at a high level, 
and when a limited number of promising candidates have been identified, a 
more thorough and accurate evaluation is required for the final hard- 
ware/software partitioning. Most importantly, the optimum system is always 
the result of a joint, truly, interactive architecture-algorithm design. A better 
algorithm (even the best) from a communication performance standpoint 
may not correspond to a suitable computational/communication architecture. 
Since no single designer can adequately handle algorithms, design method- 
ologies and architectures, a close interaction between designers (the sys- 
tem/communication engineer and the VLSI/chip architect) and design teams 
is required to master such a complex SoC design space. 

Therefore, the designer’s efficiency must be improved by a new design 
methodology which benefits from the re-use of Intellectual Property (IP) and 
which is supported by appropriate tools that allow the joint design and veri- 
fication of heterogeneous hardware and software. Particularly, owing to the 
exponential increase of both design gate counts and verification vectors, the 
verification gap grows faster than the design size by a factor of 2/3 according 
to the International Technology Roadmap for Semiconductor (1TRS) road 
map. 

This is the well known design productivity challenge that has existed for 
a long time. Figure 1-15 shows how Integrated Circuits (ICs) complexity (in 
logic transistors) is growing faster than the productivity of a design engineer, 
creating a ‘design gap’. One way of addressing this gap is to steadily in- 
crease the size of the design teams working on a single project. We observe 
this trend in the high performance processor world, where teams of more 
than a few hundred people are no longer a surprise. This approach cannot be 
sustained in the long term, but fortunately, about once in a decade we wit- 
ness the introduction of a novel design methodology that creates a step func- 
tion in design productivity, helping to bridge the gap temporarily. Looking 
back over the past four decades, we can identify a certain number of 
productivity leaps. Pure custom design was the norm in the early integrated 
circuits of the 1970s. Since then programmable logic arrays, standard cell, 
macrocells, module compilers, gate arrays, and reconfigurable hardware 
have steadily helped to ease the time and cost of mapping a function onto 
silicon. Today semiconductor technology allows the integration of a wide 
range of complex functions on a single die, the SoC concept already men- 
tioned. This approach introduces some major challenges which have to be 
addressed for the technology to become a viable undertaking: i) very high 
cost of production facilities and mask making (in 0.13 pm chips, mask costs 
of $600,000 are not uncommon); it) increase performance predictability re- 
ducing the risk involved in complex SoC design and manufacturing as a re- 
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suit of deep sub-micron (0.13 pm and below) second order effects (such as 
crosstalk, electro-migration and wire delays which can be more important 
than gate delays). These observations, combined with an intense pressure to 
reduce the time to market, requires a design paradigm shift comparable with 
the advent of the driving of ASIC design by cell libraries in the early 1980s, 
to move to the next design productivity level by further raising the level of 
abstraction. To this aim, recently the use of platforms at all of the key articu- 
lation points in the SoC design has been advocated [Fer99], Each platform 
represents a layer in the design flow for which the underlying subsequent 
design flow steps are abstracted. By carefully defining the platforms’ layers 
and developing new representations and associated transitions from one plat- 
form to the next, an economically feasible SoC design flow can be realized. 
Platform based design provides a rigorous foundation for design re-use, ‘cor- 
rect by construction’ assembly of pre-designed and pre-characterized com- 
ponents (versus full custom design methods), design flexibility (through an 
extended use of reconfigurable and programmable modules) and efficient 
compilation from specification to implementations. At the same time it al- 
lows us to trade off various components of manufacturing, Non-Recurrent 
Engineering (NRE) and design costs while sacrificing as little potential de- 
sign performance as possible. A number of companies have already em- 
braced the platform concept in the design of integrated embedded systems. 
Examples are the Nexperia platform by Philips Semiconductor [ClaOO], 
[Gra02], the Gold platform by Infineon [HauOl], and the Ericsson Mobile 
Platform by Ericsson [Mat02]. 
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Figure 1-15. The design productivity gap, showing the different Compound Annual Growth 
Rates (CAGRs) of technology (in logic transistors per chip) and design productivity (in tran- 
sistors designed by a single design engineer per month) over the past two decades. 
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Chapter 2 

BASICS OF CDMA FOR WIRELESS 
COMMUNICATIONS 



Is the reader familiar with the basic concepts in CDMA communications? 
Then he/she can safely skip the three initial Sections of this Chapter. If 
he/she is not, he/she will find there the main issues in generation and detec- 
tion of a CDMA signal, and the basic architecture of a DSP-based CDMA 
receiver. But even the more experienced reader will benefit from the subse- 
quent three sections of this Chapter, which deal with the use of CDMA in a 
satellite mobile network (with typical numerical values of the main system 
parameters), with the relevant techniques for interference mitigation (can- 
cellation), and with the specifications of the case considered in the book and 
referred to as MUSIC (Midti USer and Interference Cancellation). 



1. NARROWBAND AND WIDEBAND DIGITAL 
MODULATIONS 

The generic expression of a linear band pass modulated signal s{t) is 

s(t) = Sj (t) ■ cos(2nf 0 t) - s Q (t)-sin(27i/ 0 i) , (2.1) 

where f 0 is the carrier frequency, whilst s, ( t ) and s Q ( t ) are two baseband 
signals which represent the In phase (I) and the Quadrature (Q) components 
of the modulated signal, respectively. A more compact representation of the 
I/Q modulated signal (2.1), provided that the carrier frequency f 0 is known, 
is its complex envelope (or baseband equivalent ) defined as follows 
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s(0 =5 /(0 + -M0- ( 2 - 2 ) 

The relation between the complex-valued baseband equivalent and the 
real-valued band pass modulated signal is straightforward 

5(t) = 9f{s(t)-e j2,t/ “'}. (2.3) 

The normalized power of the baseband components s, (/) and s Q ( t ) are 

^=E{ S ;(<)},^=E{4(()j, (2.4) 

so that from (2.2) the power of the complex envelope s(t) is 

=E {|^(0| 2 } =P v +p s e - (2-5) 

From (2.1) we also find that the Radio Frequency (RF) power of the 
modulated signal s(t) is given by 

^= E K(0}= : ^^ = f • (2-6) 

Assume now that the information data source is generating a stream of 
information bearing binary symbols (hits) {u m } running at a rate R b =l/T b , 
where T b is the bit interval. The information bits are mapped onto a 
constellation of W symbols (represented as a set of points in the complex 
plane) and each complex symbol is then labeled by a ‘word’ of log 2 (W) 
bits. This mapping generates a stream of complex- valued symbols {d k } with 

d k = d l,k Jr ]d-Q k (2-7) 

running at the rate R s = 1 /T s , where T. is the symbol, or signaling, interval, 
R =R b l log, (W) . The symbols are subsequently shaped by a filter with a 
T s energy impulse response g, (t ) so as to obtain the following (baseband 
equivalent) data modulated signal 

1 2 P 00 _ 

S (*) = \h*" £ d k-g T {t-kT s ). 

V A d 



( 2 . 8 ) 
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The equation above is the general expression of a linear digital modula- 
tion where P s is the RF power of the modulated signal (2.1), as defined in 
(2.6), and Aj is the mean squared value of the data symbols 

^=E{|rf t | 2 } = 4+4 (2.9) 

with 

4 =£{<*}> <=e{44- ( 2 - 10 ) 

The block diagram of the linear I/Q modulator described in (2.8), is 
shown in Figure 2-1, where we have introduced the amplitude coefficient 

a = 4 2P J A S ( 2- 1 !) 

and its relevant baseband complex equivalent, as in (2.2), is shown in Figure 
2-2. The linear I/Q demodulator to recover the digital data from signal (2.3) 
is shown in Figure 2-3, where the (ideal) low pass filters H(f) detect the 
baseband components s, ( t ) and s Q ( t ) , and suppress double frequency com- 
ponents ( image spectra) arising from the previous mixing process. In all of 
the figures in this chapter and in the remaining part of the book, thick lines 
denote complex-valued signals. 




s(t) 



Figure 2-1. Block diagram of a linear I/Q modulator. 
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d k 





Figure 2-2. Baseband block diagram of a linear I/Q modulator. 




Figure 2-3. Block diagram of a linear I/Q demodulator (B is the signal bandwidth). 

Probably the most popular truly I/Q modulation format is Quadrature 
Amplitude Modulation (QAM) with a square lb-point constellation ( W- 
QAM). Assuming that W = w 2 , we have 

d Lk , d Q ' k e {±1, ± 3, . . . , ( yfW - 1)} = {±1, ± 3, . . . , ± (w - 1)} , (2.12) 

and therefore 



W - 1 
3 



(2.13) 



Whilst Ifi-QAM is widely used in wireline modems, satellite communica- 
tions more often rely on Phase Shift Keying (PSK) constellations. A (b'-PSK 
constellation is defined by 
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4= L (2-15) 

The two classes of PSK and QAM signals have a common element, since 
the two formats of 4-QAM and 4-PSK (or Quadrature Phase Shift keying, 
QPSK) are equivalent. 

It can be shown [Pro95] that the Power Spectral Density (PSD) of a base- 
band digital signal (expressed in [V 2 / Hz]) is 

( 2-10 

where G T (f ) is the frequency response of the pulse shaping filter (whose 
impulse response is g T (t) ), and where we have introduced the so called data 
spectrum 



oo 

k =- oo 



(2.17) 



as the Fourier transform of the (discrete time) data autocorrelation sequence 






(2.18) 



Assuming that the symbol rate R is much smaller than the carrier fre- 
quency f 0 , the PSD of a modulated signal such as (2.1) can be obtained 
from that of the relevant complex envelope, as follows 

<p s (/) = ^fe(/-/o) + ^(/ + /o)]- (2-19) 

The power of the baseband signal is then 

oo 

P,= J<P ff (/)d \f, (2.20) 

—oo 



whilst for the band pass signal we obtain from (2.19) 
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P. = ]<?.{f) d/ = f (2-21) 

—oo 

in agreement with (2.6). 

The most naive data pulse is the rectangular shaping, whereby we have 

( t \ 

g T (t) = rect — (2.22) 

\^s J 

and 

G T (f) = T s • sine (fT s ) (2.23) 

From the expression of the baseband PSD (2.16) we have that in the case 
of rectangular shaping the signal bandwidth measured at the first spectral 
null is given by 

B u,nuii = R s =VT S . (2.24) 



When bandwidth comes at a premium (as is always the case in wireless 
communications) some form of band limiting is in order, and so rectangular 
pulses are no longer used. Let us therefore introduce the Nyquist’s Raised 
Cosine (RC) pulse defined as follows 



§Nyq (0 



cos(jrat/T' s , ) 
l-(2a t/Tf 



•sine 




(2.25) 



whose spectrum is (see Figure 2-4) 




(2.26) 
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where a (with 0 < a < 1 ) is the roll off factor. The most popular band lim- 
ited transmission pulse shape is the ( T energy) Nyquist’s Square Root 
Raised Cosine (SRRC), given in the frequency domain by 

G T (f) = jf-G Nyq (f). (2.27) 

The bandwidth occupancy of a baseband digital signal with SRRC shap- 
ing is therefore (see Figure 2-4) 

*_=(! + «) A = ^ (2.28) 

and its -3 dB bandwidth is 

<2 - 29) 




Figure 2-4. Spectrum of the Nyquist’s SRRC pulse. 

In the case of Spread Spectrum (SS) systems the bandwidth occupancy of 
the transmitted signal is intentionally augmented well beyond the value re- 
quired for conventional narrowband transmissions, and this can be essen- 
tially accomplished in two ways. 

The first one, at least from an historical perspective, dates back to 1941 
and was conceived for providing secure military communications during 
WWII. The basic idea underlying this early spread spectrum concept consists 
in partitioning the overall spectrum into K separate channels or bands (also 
referred to as frequency bins), all having the same bandwidth as the signal to 
be transmitted. Spectrum spreading is achieved by transmitting the (conven- 
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tional) narrowband signal into one bin for a given time interval T h , and 
then by randomly changing the carrier frequency of the modulator so as to 
place the signal spectrum in another bin, and so forth. The final result is that 
the signal spectrum ‘hops’ from bin to bin with an apparently casual pattern 
to escape hostile jamming and/or eavesdropping by unauthorized listeners. 
Owing to such a feature, this technique has been named Frequency Hopping 
Spread Spectrum (FH/SS). Actually, the pattern of the frequency bins is not 
truly casual, but it is instead pseudo-random (i.e., apparently random) and 
repeats every K-T hop seconds. The (periodic) frequency pattern is made 
known also to the authorized receiver which is then capable of tracking the 
transmitted carrier. 

It turns out that the total bandwidth occupancy of this SS system is K 
times that of the original narrowband signal. It is also apparent that the ro- 
bustness of a FH/SS system against interference and unauthorized detection 
increases with the length K of the frequency hopping pattern, i.e., with the 
total occupied bandwidth. However, owing to strict requirements concerning 
oscillator frequency stability and switching rate, FH/SS does not find signifi- 
cant applications in multiple access commercial systems for mobile and cel- 
lular communications, and therefore it will not be considered any further in 
the following. The interested reader can find more details about the origins 
of SS in [Sch82] and about FH/SS transmssion in [Sim85], [Dix94], 

An alternative way of generating a SS signal consists of the direct multi- 
plication, in the time domain, of the information bearing symbols, running at 
the rate R s =1/7) , with a sequence of binary symbols {chips) running at a 
much higher rate R c =1/7) (with T c the chip interval), and with a repetition 
period L [Dix94], [Pic82], [Sim85]. This multiplication, which is carried out 
before transmit pulse shaping takes place, produces a stream of high rate 
symbols running at the rate R c - the resulting signal to be transmitted turns 
out to have a bandwidth occupancy wider than that of conventional modula- 
tion schemes. For this reason the sequence of high rate chips is also referred 
to as spreading sequence or spreading code, and this wideband transmission 
technique is called Direct Sequence Spread Spectrum (DS/SS). This is the 
most used spread spectrum signaling technique used in commercial commu- 
nication systems. In the following we will therefore restrict our attention to 
DS/SS transmission schemes. 

Let us focus now on the analytical description of a DS/SS signal format 
and on the relevant features [DeG96]. In the simplest arrangement the 
spreading sequence has real-valued chips c k selected at random in the binary 
alphabet {-1,1} (this is what is done, for instance, in the uplink of the 
American cellular CDMA systems IS-95 or cdmaOne). Thus the spreading 
code is random non-periodic. It may also be expedient to have a periodic 
pseudo-random spreading code with a repetition length (repetition period) 
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equal to L chips. When this short code arrangement is adopted (as for in- 
stance in the UMTS downlink), the repetition period of the spreading se- 
quence usually coincides with one symbol interval, i.e., f =LT C . The re- 
sulting DS/SS signal for the short code format is 

oo 

5 (ss) (t) = A-J^ d {k]i ■ c kV • g T (t - kT c ) , (2.30) 

k=- oo 

where we have introduced the following operators 

{k} L =intf-^J , \k\ L = kmo&L . (2.31) 

Figure 2-5 depicts the DS/SS transmitter described by (2.30), in which 
the amplitude coefficient A has the same meaning as in Figures 2-1 and 2-2. 
It can be shown, in fact, that the PSD of the DS/SS signal is given by the 
convolution between the narrowband PSD of the data and the wideband PSD 
of the spreading sequence, and that the power of the DS/SS signal is exactly 
the same as the power P s of the narrowband signal. 



c k 

Figure 2-5. Baseband block diagram of a DS/SS transmitter. 

It is apparent that the sections of the DS/SS transmitter in Figure 2-5 are 
the same as those in Figure 2-2, provided that the chip rate R c =1/7) is ex- 
changed with the symbol rate R s . In particular, the chip shaping filter in 
Figure 2-5 outputs a new T c energy pulse g T (t) every T c seconds. The in- 
dex k ticks at the rate R c , the chips’ subscript \k\ L repeats every L chip in- 
tervals, and the data symbols subscript {k} L ticks at the rate LR c = R as it 
should. In the case of a DS/SS transmission with rectangular shaping, the 
pulse (2.22) then becomes 

g r (t) = rect 






(2.32) 
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and the bandwidth occupancy (2.24) must be modified accordingly, 



B 



(SS) 

Is? null 




(2.33) 



Apart from some particular applications, mainly military and the Global 
Positioning System (GPS), the spectrum of the DS/SS signal will be strictly 
limited. This can be achieved by resorting to Nyquist’s SRRC shaping of the 
chip pulses. These pulses are described by (2.25)-(2.27) where the symbol 
duration 7) must be replaced with the chip interval T c . Similarly the band- 
widths (2.28) and (2.29) modify as follows 



B 



(ss) = 

max 




1 + a 



(2.34) 



B 



(ss) _ 

-3dB — 



K 

2 




(2.35) 



respectively. After spreading and chip pulse shaping, the baseband signal 
(2.30) is eventually upconverted to RF and transmitted using a pair of l/Q 
carriers as in Figure 2-1. 



2. PROPERTIES OF SPREAD SPECTRUM 
SIGNALS 

We generalize now the DS/SS signal format (2.30), by allowing in par- 
ticular that the code repetition length L be different from the so called 
spreading factor 

M = — = — (2.36) 

r R s 

We obtain thus the following expression [DeG96] 

oo 

s (ss) ( t ) = A-Z d [k)u ■ c K ■ g T (t - kf ) , 

k=- oo 



(2.37) 
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where, according to the definitions (2.31), the subscript {k} M of the data 
symbols is now updated every M ticks of the index k . This is the most 
general form of a DS/SS signal. 

We also remark that for any of the bandwidth definitions presented above 
we have 



M = 



B (ss) 

B 



(2.38) 



Therefore the parameter M can be seen as the ratio between the band- 
width occupancy of the SS signal B {SS) and the bandwidth B of the conven- 
tional narrowband signal. This is why we called M the spreading factor, that 
represents one of the main parameters characterizing the properties of a SS 
signal. Figure 2-6 shows the PSD of a code for DS/SS transmissions with 
spreading factor M = 8 compared with the PSD of a conventional narrow- 
band signal with rectangular pulse shaping. If one symbol interval spans ex- 
actly an integer number n of the spreading sequence repetition periods, 

T s = n-L-T c , (2.39) 



then the spreading sequence is termed short code and the spreading factor is 
M = n-L . Otherwise, if the repetition period of the spreading sequence is 
much longer than the symbol duration, L-T c ps>T s , then the sequence is 
called long code and the spreading factor is M . As a particular case of 
short code spreading we have the simplest configuration with n = 1 men- 
tioned in the previous section, where the code repetition period is exactly co- 
incident with one symbol interval, i.e., M - L . 

One other important parameter of SS signals is the processing gain G p , 
which is often confused with the spreading factor, and which actually has to 
do with the anti-jam capability of the receiver. The processing gain is de- 
fined as the ratio between the chip rate and the information bit rate 



G. 




(2.40) 



Recalling that R s =R b l log 2 (W) , it is easy to derive the relationship be- 
tween spreading factor and processing gain 



M = G p -log 2 (W). 



(2.41) 
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A widely used method for the generation of a binary spreading sequence 
{c k } is based on an m stage Linear Feedback Shift Register (LFSR) used as 
a binary sequence generator. The feedback taps of the circuit are properly set 
to yield a sequence with particular features which are desirable for SS 
transmissions [Dix94], [Pic82], [Sim85], [Din98]. According to the polyno- 
mial theory [Pet72], it is possible to select the feedback taps so as to obtain a 
sequence with the longest possible repetition period L = 2"' - 1 and good 
correlation properties [Sar80], the so called maximal length or m-sequences. 
Tables for the configurations of m-sequence generators are easily found in 
the literature [Dix94], [Pic82] . In an n? -sequence the number of logical ‘0’s 
and Ts is 2"'~ 1 -1 and 2 m ~ 1 , respectively [Dix94]. Since in all practical 
cases we have L » 1 the occurrence rate of each symbol is virtually the 
same within each period, so that such sequences are called balanced. A 
deeper analysis of the location of ‘0’s and ‘ 1 ’s within one period also reveals 
that the blocks of consecutive identical symbols (the so called runs) follow a 
given distribution that makes the appearance of the sequence similar to that 
of a truly random one [Dix94] . Owing to these properties a sequence of this 
kind is also named Pseudo-Random Binary Sequence (PRBS). 




Figure 2-6. PSD of a spreading code for DS/SS transmissions, M= 8. 

Concerning correlation properties, we define first the following periodic 
cross-correlation sequence between the two generic binary sequences {a k } 
and {b k }, a k ,b k e {-!,+!} 



1 i_1 

R «iA i )=j-'L a k A + ,, • 

F k=o 



(2.42) 
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K b =K„(0) (2-43) 

The (periodic) autocorrelation sequence of the sequence j c k } is also 
given by 



1 1-1 

R cc{ i ) = ~r'Tj c k ' C \k + L • 
B k= o 



(2.44) 



We can easily show that the autocorrelation sequence of an m -sequence is 



MO 



[i /= o, 

{-1/1 i* 0. 



(2.45) 



Such an autocorrelation function is similar to that of a delta-correlated 
white noise process. Therefore, owing to this ‘noise-like’ behavior, such se- 
quences are also referred to as Pseudo-Noise (PN). 

In the spreading arrangement (2.30) discussed above the spreading code 
symbols c k are real-valued, and therefore such an SS scheme is referred to 
as Real Spreading (RS). In particular, if the data d k are complex-valued the 
SS format in (2.30) is called Quadrature Real Spreading (Q-RS). 

This scheme is probably the most intuitive, but it is not the only one. Fur- 
ther spreading schemes involving different spreading codes for the I Q com- 
ponents are used in the practice. For instance, if we let (see Figure 2-7) 

oo , v 

s (ss) (t) = A.X [d I{k}u ■ c, Hl + j d Q[k}u ■ c Qj/c | J -g T (t- kT e ) (2.46) 

k=- oo 



we have two information bearing symbol streams {dj ^x } , {d Q ^ } and 
two distinct spreading sequences {c, | t } , {c Q k } . As a consequence the 
complex spreading scheme (2.46) can be seen as a combination of two RS 
schemes operating independently, the one for the I-, and the other for the Q- 
finger, respectively. This is called dual Real Spreading (d-RS) since it uses 
two different code sequences to independently spread two different data 
streams. By replicating the same data stream on both the I- and Q-finger of 
(2.46), we can derive a particular case of spreading,, i.e., 

d I.{k} M =d Q,{k} u =d {k} 



( 2 . 47 ) 
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with d {k y M real-valued, thus obtaining 

5 (ss> (r) = A ■ X d [k]u • (c, |Al + j c qK ) • g r (f - kT c ) . (2.48) 

k=- oo 




Figure 2-7. Baseband block diagram of a DS/SS transmitter 
employing two spreading sequences. 

The latter SS format is named Complex Spreading (CS) since it uses a 
complex-valued sequence to spread a stream of real-valued symbols. A per- 
formance comparison among the different spreading schemes outlined above 
will be presented in a subsequent Section. 

Now, let us focus our attention on the detection of a DS/SS signal. The 
simplified architecture of an I/Q receiver for a DS/SS signal is shown in Fig 
2-8. The received signal undergoes I/Q baseband conversion by means of a 
front end such as the one depicted in Figure 2-3. Assuming for simplicity a 
DS/SS like in (2.30), the signal after baseband conversion is (perfect carrier 
recovery) 

oo 

f{t) = A- X d, k]u - c k[ ■ g r (t-kT c ) + w(t), (2.49) 

k =-oo 

where w{t) is Additive White Gaussian Noise (AWGN) representing the 
complex envelope of the channel noise affecting the modulated signal. The 
noise process can be expressed as 



w(0 = w /(0 + j w e(0, 



(2.50) 
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where w, (, t ) and w Q ( t ) are two baseband signals representing the in phase 
and quadrature components, respectively, of the AWGN process. Such com- 
ponents are independent, zero mean, white Gaussian random processes with 
two-sided PSD 

<P w ,{f) = <P Wg {f) = N 0 . (2.51) 




C Q.m 



Figure 2-8. Block diagram of a DS/SS receiver. 

After baseband conversion each component of the complex-valued re- 
ceived signal (2.49) is passed through a filter matched to the chip pulse 
shape (Chip Matched Filter, CMF). Assuming Nyquisf s SRRC chip pulse 
shaping, the I and Q receive filters have frequency response given by 



G R {f) = ^G Nyq (f)!T c (2.52) 

The filtered signal is then 



v(t) = A- £ d {% -c K ■ g Nyq {t~kT c ) + n(t ), (2.53) 

£=—oo 

where we have introduced the (chip time) Nyquisf s RC pulse (see (2.52) 
and (2.27)) 



§ Nyq (t) = g T (t)®g R (t) 



(2.54) 
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and we have defined the filtered zero mean Gaussian noise process 

«(f) = w(r)®S*(0 (2.55) 

with ® denoting convolution. The signal output by the filter is then sampled 
at chip rate. In the case of perfect chip clock recovery, the sampling instants 
are t m = mT c and the resulting chip rate digital signal is 

oo 

y,n = y{ f m ) = a 'Yu K) u ■ % ■ g Nyq [{ fn -k)T c ] + h m . (2.56) 

k=—co 

The noise component in (2.56) is defined as 

»*="/,« +j"{^ =«('»,)’ ( 2 - 57 ) 

whose I/Q components (n Im ,n Qm ) are independent identically distributed 
zero mean Gaussian random variables with variance a„ = a~ given by (see 
(2.51)) 

00 00 T.J 

<= {^(/)|G,(/)| 2 d/ = AI 0 {|G,(/)| 2 d/ = ^f. (2.58) 

—00 —00 c 

Recalling the features of the Nyquist’s RC pulse (2.25), we remark that 
g N J(m-k)T c \ is zero for m^k and is 1 when m = k (Nyquist-1 pulse) 
and we are left with 



L » 1 (2.59) 

The receiver generates a replica of the spreading sequence used by the 
transmitter (the co called replica code), and executes the product between 
the chip rate samples y m and the chips of such local code replica to ‘re- 
move’ the code from the received signal. In practice this is equivalent to the 
operation of despreading carried out by conventional analog SS receivers. 
After multiplication (despreading) the samples are accumulated over an M- 
symbol long window and finally downsampled at symbol rate R s to yield the 
decision strobes z k for data detection 




(2.60) 
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Equation (2.60) actually describes a digital integrator over the symbol 
time which performs symbol matched filtering. Substituting (2.59) in (2.60), 
we have 



^ kM+M-l 

z,= V d , , -Ci i -c, i +v, . (2.61) 

M ^ W« 14 14 a v ' 

2Vl m=kM 

In (2.61) we have a noise term 

1 kM+M-\ 

4 =v,,k +jv e ,, = — • X -c (2.62) 

-*** m=kM 

whose l/Q components are independent, identically distributed, zero mean 
Gaussian random variables with variance 



_2 _2 1 \ J rr-2 N 0 N 0 

o, = ct = — --Mo., = = — . 



M 



MT T 



(2.63) 



Recalling the definitions in (2.31), we observe that, for kM < 
m < kM + M - 1 we have {m} M = k , thus 



2, = A ■ d, 



1 IcM+M- 

— y 

M 



C,_, -c, , +v, 



(2.64) 



or 



l k M+M - 

I 



c w, +v. 



(2.65) 



17 m=kM 

But for binary spreading codes 

(2.66) 

and we eventually obtain the following decision variable for data detection 
z k =A-d k +v k . (2.67) 



C| | =1, 

K 



Such a decision strobe is eventually passed to the final detector, which is 
just a sheer to regenerate the transmitted digital data stream if no channel 
coding is used. The cascade of multiplication, accumulation, and downsam- 
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pling produces a ‘sufficient statistics’ which is the same as in a conventional 
matched filter narrowband receiver. Actually, from (2.60) we see that the 
cascade of despreading and accumulation can be seen also as the computa- 
tion of a correlation between the incoming signal (2.56) and the local code 
replica, and the downsampling is aimed at taking the maximum of the corre- 
lation. This is why the detector in Figure 2-8 is also called the Correlation 
Receiver (CR). We remark again that the decision strobe in (2.67) is exactly 
the same we would obtain in the case of a conventional narrowband trans- 
mission of complex-valued symbols d k over AWGN channel and using a 
matched filter receiver. Therefore the spreading and despreading operations, 
carried out in the transmitter and receiver, respectively, are completely 
transparent to the final user. 

The Bit Error Rate (BER) P{e) of a DS/SS transmission over an AWGN 
channel is therefore coincident with that of a conventional narrowband 
transmission employing the same modulation format. For instance, in the 
case of a Binary PSK (BPSK) or a Quadrature PSK (QPSK) we obtain (see 
Figure 2-9) 



P(e) = Q 



2 E u 



N, 



o J 



( 2 . 68 ) 



where E h = P s T h is the average received energy at RF per information bit 
and Q(x) is the Gauss’ integral function 



Q(*)=f 




e 



Vn 



dy. 



(2.69) 



According to the theoretical description outlined above, the detection of a 
DS/SS signal can be accomplished by using the receiver architecture shown 
in Figure 2-8, whose baseband equivalent block diagram is depicted in Fig- 
ure 2-10. It is also apparent that DS/SS communications require more com- 
plex receivers with respect to conventional narrowband signaling [De98b]. 
Suffice it to say that sampling at the filter output must be performed at chip 
(instead of symbol) rate, and this requires much faster signal processing ca- 
pabilities. Furthermore, timing recovery must be carried out with respect to 
the chip (instead of symbol) interval, and this is much more demanding in 
terms of accuracy of synchronization. Finally, as can be inferred from (2.61), 
a proper alignment between the received signal and the local replica of the 
spreading sequence is mandatory in order to successfully perform signal de- 
spreading through a perfectly aligned correlation. 
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From the considerations made above, it is evident that the most peculiar 
and crucial function which the DS/SS receiver has to cope with is timing re- 
covery. The basic difference between the function of symbol timing recovery 
in a conventional modem for narrowband signals and code alignment in a 
wideband SS receiver lies in a fundamental difference in the statistical prop- 
erties of the data bearing signal. In narrowband modulation the data signal 
bears an intrinsic statistical regularity on a symbol interval T. that is, prop- 
erly speaking, it is cyclostationary with period T. . Clock recovery is to be 
carried out with an accuracy of some hundredths of a T s , and is not particu- 
larly troublesome. Owing to the presence of the spreading code, the DS/SS 
signal is cyclostationary with period LT C (in a short code arrangement), but 
the receiver has to derive a timing estimate with an accuracy comparable to a 
tenth of the chip interval T to perform correlation and avoid Inter-Chip In- 
terference (ICI). 




Figure 2-9. BER of a matched-filter receiver for BPSK / QPSK transmission 
over the Gaussian channel. 



This simple discussion suggests that timing estimation becomes more and 
more involved as L gets large (long codes). Unfortunately, in practical 
applications of DS/SS transmissions we always have L » 1 even for short 
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codes (typically L > 3 1 ), so that the problem of signal timing recovery with 
a sufficient accuracy is much more challenging for wideband DS/SS signals 
than for narrowband modulation, and is usually split in the two phases of 
coarse acquisition and fine tracking. The first is activated during receiver 
startup, when the DS/SS demodulator has to find out whether the intended 
user is transmitting, and, in the case in which he/she actually is, coarsely es- 
timate the signal delay to initiate fine chip time tracking and data detection. 
Code tracking is started upon completion of the acquisition phase and aims 
at locating the optimum sampling instant of the chip rate signal to provide 
ICI-free samples (such as (2.59)) to the subsequent digital signal processing 
functions. 




Figure 2-10. Baseband equivalent of a DS/SS receiver. 

After examining the main functions for signal detection, we present some 
introductory considerations about the practical implementation of a DS/SS 
receiver. In this respect Figure 2-11 shows a scheme of a DS/SS receiver 
highlighting also the different signal synchronization functions (carrier fre- 
quency/phase and timing) which often represent the real crux of good mo- 
dem design. We have denoted by A f , 0, and x the estimates of the carrier 
frequency offset, phase offset, and chip timing error, respectively, relevant to 
the useful signal. As already discussed (see Figure 2-8), the baseband I/Q 
components of r{t) are derived via a baseband I/Q converter as the one in 
Figure 2-3. Such a converter is usually implemented at IF in double conver- 
sion receivers or directly at RF in low cost, low power receivers (this is the 
case, for instance, for mobile phones). 

The basic architecture of Figure 2-11 can be entirely implemented via 
DSP components by performing Analog to Digital Conversion (ADC) as 
early as possible, at times directly on the IF (intermediate frequency) signal 
provided at the output of the RF to IF front end conversion stage in the re- 
ceiver. In so doing, the baseband received signal r(t ) in Figure 2-11 is actu- 
ally a sampled digital signal, carrier recovery and chip matched filtering are 
digital, and the ‘sampler’ is just a decimator/interpolator that changes the 
clock rate of the digital signal. The ADC conversion rate of r{t) is, in fact, 
invariably faster than the chip rate to perform chip matched filtering with no 
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aliasing problems. We shall say more about the digital architecture of the 
DS/SS receiver in Chapter 3 when dealing specifically with the MUSIC de- 
modulator. 




Figure 2-11. Architecture of a receiver for DS/SS signals, including synchronization units. 



3. CODE DIVISION MULTIPLEXING AND 
MULTIPLE ACCESS 

In the DS/SS schemes discussed above the data stream generated by an 
information source is transmitted over a wide frequency spectrum using one 
(or two) spreading code(s). Starting from this consideration we can devise an 
access system allowing multiple users to share a common channel transmit- 
ting their data in DS/SS format. This can be achieved by assigning each user 
a different spreading code and allowing all the signals simultaneously access, 
in DS/SS mode, the same frequency spectrum. All the user signals are there- 
fore transmitted at the same time and over the same frequency band , but they 
can nevertheless be identified thanks to the particular spreading code used, 
which is different from one user to another (the so called signature code). 
The users are separated in the code domain, instead of time or frequency 
domain, as in conventional Time or Frequency Division Multiple Access, re- 
spectively (TDMA, FDMA). Such a multiple access technique, based on 
DS/SS transmission, is therefore called Code Division Multiple Access 
(CDMA) and the spreading sequence identifying each user is also referred to 
as signature. The N user signals in DS/SS format can be obtained from a set 
of N tributary channels made available to a single transmitting unit which 
performs spectrum spreading of each of them, followed by Code Division 
Multiplexing (CDM). Alternatively, the DS/SS signals can be originated by 
N spatially separated terminals, and in this latter case code division multi- 
plexing occurs at the receiver antenna. 

Let us focus our attention on the detection of a DS/SS signal in the case 
of a multiuser CDMA system in which N users are concurrently active. For 
the sake of simplicity, we refer once again to the simplified signal model 
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(2.30). We will start considering the case a CDMA multiuser communication 
in which all of the spreading sequences of the different users are synchro- 
nous, i.e., the start epoch is exactly the same for each code. We will refer to 
this arrangement as Synchronous CDMA (S-CDMA). This is the case of a 
CDMA signal originated from a single transmitter, i.e., from a base station 
(or satellite) to a group of mobile receivers. We will therefore address such a 
scenario as single-cell. The received signal, after baseband conversion and 
under the hypothesis of perfect carrier recovery, can be written as 

N 00 ^ 

F (0 = Z^ t ' ) - Z 41 -<4 -grit-kT^ + w^), (2.70) 

1=1 £=- oo 

with the same definitions as in the single-user case described by (2.49), 
where for each user’s channel we have defined the amplitude coefficient (see 
( 2 . 11 )) 

A (i) =^2Pj A] . (2.71) 

Notice that, in order to take the multiple users accessing the RF spectrum 
into account we have introduced the superscript (,) which identifies the am- 
plitude, data, and code chips of the generic i th user. The generation of the 
aggregate code division multiplexed signal (2.70) is conceptually depicted in 
Figure 2-12 

Assuming now, without loss of generality, that the receiver intends to de- 
tect the data transmitted by user 1, we can re-write (2.70) as 

oo 

r{t) = A (l) ■ Z 41 ' C S1 kT c ) 

£=-oo 

N co ^ 

+Z a(,) • Z 41 ' C K • St 0 - kT c ) + w(t) , (2.72) 

/= 2 £=-oo 

where the first term at the right hand side is the useful signal to be detected, 
while the second one, denoted in a more compact form as 

N co 

40 = Z a( ° • Z 41 '4 -gr(t-kT c ) (2.73) 

1=2 £=-oo 

is an additional component owed to multiple access. In a conventional corre- 
lation receiver, the received signal (2.72), is passed through the chip 
matched filter and sampled at chip rate yielding the samples v m (see (2.59)) 
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y =A^-df\ - c, ( 1 1 + V • d\'\ -c^i +n 



= A fI> ■ df\ ■ cf] + b + n . 

Mm K m m ■ 



where b m = b(t m ) is given by 



K = b I , m +]b Qm =Y j A (,] -d 



Mi, Ml 



(2.74) 



(2.75) 




Figure 2-12. Generation of a multiuser S-CDMA signal. 

The I/Q components b I m and b 0 m in (2.75) are independent, identically 
distributed, zero mean random variables whose variance is 

ol=c 2 he =E{bl m }. (2.76) 



The sampled signal (2.74) undergoes correlation (despreading / accumu- 
lation) with the signature code of user 1 as follows 



1 kM+M - 1 

Z y m ' 1 



(2.77) 



After some algebra we find (see (2.61)-(2.64)) 
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(2.78) 



and 



4 !) = ^ (1) • ^ • X c (i) c P) 0) + X • X c (o c m (*) + v* , (2.79) 

i= 2 

where we have defined the following partial auto- and cross-correlations 
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(2.81) 



The decision strobe is eventually passed to the final detector which re- 
generates the transmitted digital data stream of the user 1 (the desired, ‘sing- 
ing’ user). From (2.79) it is apparent that the decision strobe zf is com- 
posed of three terms: i) the useful datum (first term); ii) Gaussian noise 
(third term); and iii) an additional term arising form the concurrent presence 
of multiple users and called Multiple Access Interference (MAI). In particu- 
lar, the MAI term can be expressed as 






or equivalently, according to definition (2.75), as 



1 kM+M - 1 

P*=Pr,*+jP e,*=— I k-c\\ 



(2.82) 



(2.83) 



The l/Q components \ > > lk and $ Qk are independent, identically distrib- 
uted, zero mean random variables whose variance can be put in a form simi- 
lar to (2.63) 
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(2.84) 



where we have introduced an equivalent PSD /„ of the MAI term, as- 
suming implicitly that it can be considered flat (white) over the whole signal 
spectrum. Now re-cast (2.79) into the form 

z®=jP-d ( ? + $ k +v k . (2.85) 

Under certain hypotheses which we will discuss in a little while, the MAI 
contribution can be modeled as an additional (white) Gaussian noise 
(independent of v k ). Therefore the BER performance of the DS/SS signal 
can be analytically derived simply by assuming an equivalent noise term 
v' k = (3 A + v k with a total, equivalent PSD given by 

K=N 0+ I 0 , (2.86) 



and the decision strobe becomes equivalent to that in (2.67), which refers to 
a pure AWGN channel 



« (1) 



■dl"+V k . 



(2.87) 



Consequently the BER for QPSK modulation in the presence of Gaussian 
MAI, can be obtained by a simple modification of expression (2.68) 



P(e) = Q 



'K j 



= Q 



2K 



'N 0 +I 0J 



(2.88) 



If very long pseudo-random (i.e., noise-like) spreading sequences are 
used then the chips c£j of each user code can be approximately modeled as 
independent random variables belonging to the alphabet {-1,+1}. Also, the 
chips of different users can be modeled as uncorrelated random variables. It 
follows that if N » 1 and if all of the signal powers are (almost) equal (i.e., 
P = P , Vi ), then the power of the MAI is P MAI = (N - \ )P and by virtue of 
the central limit theorem, we can model the MAI components P 7 k and P 0 k 
at the detector input as independent identically distributed zero mean Gaus- 
sian random variables with variance (see (2.76) and (2.83)) 
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This situation is actually experienced, for instance, in a CDMA system 
with accurate power control, so that all the users signals are received at (al- 
most) the same power level. Under this hypothesis the PSD of the MAI is 

h = T s {N ~^ Ps ={N-\)P s T c ={N-\)E c = E b , (2.90) 



where E c = P s T c represents the average energy at RF per chip, and according 
to (2.40) we have set E c =E h /G p . The BER (2.88) becomes then 



P{e) = Q 



2E k 



\N 0 +{N-l)E c 



(2.91) 



and with some manipulations we obtain for QPSK 

( \ 



P(e) = Q 



2 £> 
\ N 0 



V 



1 

L (N-l) 2E b 
V M N o 



(2.92) 



From the expressions above it turns out that the MAI degrades the BER 
performance. In particular, the degradation increases with the number of in- 
terfering channels and decreases for large processing gains. Notice also that, 
in the particular case N = 1 (2.92) collapses to the conventional BER ex- 
pression relevant to (narrowband) QPSK modulation over AWGN channel 
and matched filter detection. 

However, we must remark that in the more general case of CDMA 
transmissions with MAI ( N > 1 ) (2.92) is accurate only under certain condi- 
tions. In particular, the assumption of uncorrelated binary random variables 
for the code chips is valid only when the signature codes are ‘long’ in the 
sense of Section 2. As is apparent from (2.82), the amount of MAI is in real- 
ity determined by the cross-correlation properties between the useful signal 
and the interferers. Therefore, in order to derive a more accurate analytical 
expression for the BER the particular type of spreading codes and the rele- 
vant correlation properties must be accounted for. In order to simplify the 
analytical description, from now on we shall focus on the case of short 
spreading codes, i.e., M = n-L. Recalling (2.42), the cross-correlation 
(2.81) is now 
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1 kn L+nL —l 1 kn L+L — 1 

X c „ v „(*) = — • X $ X <|/^K=^(0 C (,,- (2-93) 

m=knL nj - / m=knL 

The variance of the I/Q components of the MAI samples P„, must be re- 
written by resorting to (2.82), yielding 

<=°l,=T = £ p .. R k«. <W0 

A i = 2 

and the PSD of the MAI contribution to the total noise in (2.86) becomes 

h = T st P S, R kv (Z95) 

i '=2 

In the case of equi-powered users we obtain 

(2-96) 



where E s = PT. represents the average energy at RF per modulation symbol. 
Since, for QPSK, R. = R h / 2 , we have E s = 2 E b and therefore 



I 0 = 2E b £Rl 



.(OJi) 



(2.97) 



By retaining the assumption of a Gaussian distribution of the MAI, which 
holds true in the case of large spreading factors and large number of users, 
the BER is now 



P(e) = Q 



2E± 

1 N n 



1 



4 ^ f 



(2.98) 



From the expression above we can conclude that in order to limit the 
detrimental effect of MAI on BER performance the spreading sequences 
must be chosen so as to exhibit the lowest possible cross-correlation level. In 
the case of maximal length sequences with L » 1 , the cross-correlation is 
well approximated by [Sar80] 
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thus, recalling that M = n-L , we obtain 
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P{e) = Q 




V 



1 

l u 2 E b 

M N 0 



(2.99) 



( 2 . 100 ) 



which for n = 1 coincides with the BER expression (2.92) previously de- 
rived for the white Gaussian MAI model. Actually (2.99) represents the 
RMS value of the cross-correlation between two L -period maximal length 
sequences taken over all the possible relative phase shifts. However, it is 
found that, in spite of their many appealing features, m-sequences are not 
convenient for CDMA. First, for a given m there exists only a limited num- 
ber of sequences available for user identification in a CDMA system 
[Din98]. Also, the cross-correlation properties of -sequences are not opti- 
mal, so they result in significant levels of MAI. 

The MAI term (3^ in (2.85) can be canceled by using orthogonal codes 
such as R c (o c (y) =0 ( i ^ j ). A popular set of orthogonal spreading codes is 
represented by the Walsh-Hadamard (WH) sequences [Ahm75], [Din98] 
which have period L = 2'" and are obtained taking the rows (or the columns) 
of the LxL matrix H m recursively defined as follows 



H 



l~H , 


H , 1 




"+1 +f 


m — 1 


m- 1 


, H. = 






" 1 


_+l -1 



( 2 . 101 ) 



where -H m means the complement (i.e., the sign inversion) of each element 
of the matrix H m . From (2.101) it is apparent that for a given period L the 
WH set is composed of L sequences. Thanks to orthogonality the BER per- 
formance for an Orthogonal CDMA (O-CDMA) system is obtained by re- 
moving the MAI contribution in (2.98), which gives the conventional ex- 
pression for narrowband BPSK/QPSK modulation (2.68). 

Despite such an appealing feature, it must be noticed that the WH se- 
quences exhibit very poor off zero auto- and cross-correlation properties 
making difficult initial code acquisition and user recognition by the receiver. 
For this reason, in practical applications pure orthogonal codes such as the 
WH sequences must be used overlaid by a PN sequence [Fon96], [Din98]. 
According to this approach the resulting composite code is therefore the su- 
perposition of two codes, i.e., an orthogonal WH code {cjMra } e {±1} for 
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user identification (the so called traffic or channelization code) plus an over- 
lay PN sequence { c k | PN } 6 {±1} , common to all the users within the same 
cell (or satellite beam), as follows 





( 2 . 102 ) 



The cell- (or beam-) unique overlay code yields a twofold benefit: first, it 
represents a sort of cell (or beam) identifier and second, it performs a ‘ran- 
domization’ of the user signature that is helpful in reducing unwanted off 
zero auto- and cross-correlation peaks. For this reason the overlay code is 
also called scrambling code. It is immediate to observe also that orthogonal- 
ity between any pair of composite sequences is preserved, i.e., Ryyyj) = 0 
(i * j ) still holds true. Finally, notice that if we want to obtain a composite 
code having exactly the same period L = 2"' as the original WH sequence we 
must select an overlay maximal length sequence having period 2"' - 1 , and 
properly extend it by inserting a ‘+1 ’ chip into its longest run. Such a modi- 
fied sequence is called Extended PN (E-PN). The use of orthogonal codes 
(either simple or composite) cancels the MAI term (3 t out of (2.79), yielding 
the very same decision strobe as in the single-user case (2.67). 

Other sets of codes widely used as spreading sequences in practical 
CDMA systems are the quasi-orthogonal ones. These codes have non-null 
(yet small) cross-correlation ( /C, )( , (( i * 0 , for i j ), but exhibit less critical 
off zero correlation performance. For instance, by a proper combination of 
two selected PN sequences with period L = 2"' - 1 , we obtain the Gold se- 
quences [Gol67], [Gol68], [Sar80], [Din98], which have period L, cross- 
correlation R c0)cm =-l / L and small 3-valued off zero cross-correlation. It is 
also found that the number of Gold codes having period L is L + 2 (apart 
from the particular case L = 255 which admits only L + 1 = 256 codes) 
[DeG91]. From (2.98), and recalling that M = n-L, the BER performance 
for a Quasi-Orthogonal CDMA (QO-CDMA) system employing QPSK 
modulation, becomes 



( 






P{e) = Q 
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(2.103) 



and, from (2.97), the PSD of the equivalent noise owed to MAI is 
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Similarly to the PN, Gold sequences can also be extended by proper in- 
sertion of an additional chip into one sequence period in order to obtain a set 
of quasi-orthogonal codes with repetition period L = 2"' which is called Ex- 
tended Gold (E-GOLD). Another set of quasi-orthogonal codes is the Ka- 
sami set [Kas68], [Sar80], [Din98], The first step to obtain a Kasami se- 
quence is decimation of an m-sequence, with m even, by a factor s = 2"' /2 + 1 
(thus obtaining a further m-sequence with period 2"' /2 -1 [Sar80]), and ex- 
tension by repetition (5 times) of the decimated sequence up to the original 
length. The set is then constructed by collecting all of the sequences obtained 
by addition of any cyclical shift of the decimated/extended sequence to the 
original m-sequence, and including the original sequence as well. The total 
number of elements (sequences) in the set is thus 2'" 2 . The cross-correlation 
sequence for two Kasami sequences takes on the three values: -1 , -s , and 
5 - 2 . 

The large set of Kasami sequences consists of sequences of period 
2'" 2 -1, with m even, and contains both a set of Gold (or Gold-like) se- 
quences and the small set of Kasami sequences as subsets. To obtain the Ka- 
sami large set, we start with two equal length m-sequences y and z both ob- 
tained after decimation/repetition of a ‘mother’ longer /« -sequence x as 
above. We then take all of the sequences obtained by adding x, y, and z with 
any possible (cyclical) shifts of v and z, for a total number of 2"' /2 (2"' +1) 
sequences if | m | 4 = 2 , and 2"' /2 (2"' + 1) -1 if | m | 4 = 0 . The auto- and cross- 
correlation sequences are limited to 5 particular values we will not specify 
here (more details can be found in Kasami’s seminal paper [Kas68], in the 
extensive investigation about codes correlation properties by Sarwate and 
Pursley [Sar80] and in the survey on spreading codes for DS-CDMA by Di- 
nan and Jabbari [Din98]). 

Let us now compare, in terms of capacity, the spreading arrangements 
previously discussed in Section 2. We assume a CDMA system with N ac- 
tive users, each transmitting at a bit rate R h , and employing short spreading 
codes with period L and spreading factor M = n-L . Considering, for the 
sake of simplicity, a set of WH codes we then have that L sequences are 
available to the users. System capacity performance is expressed in terms of 
spectral efficiency, defined as 



N-R h 

^ = ~ B m 



(bit/sec)/Hz, 



(2.105) 
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where the SS bandwidth is R (SS) =R C (Nyquist bandwidth). Table 2-1 com- 
pares the different spreading arrangements previously outlined. 

Table 2-1. Comparison among spreading arrangements 



Spreading 

Arrangement 


Constellation 

Symbols 


B (ss) 
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“H 

[bit/s/Hz] 
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BPSK 


n ■ L ■ R b 


n ■ L 


n ■ L 


L 


l/n 


d-RS 


2 x BPSK 


n-LRj2 


n • l/ 2 


n ■ L 


i/2 


1 / n 


CS 


BPSK 


n ■ L ■ R b 


n ■ L 


n ■ L 


i/2 


1/(2 n) 


Q-RS 


QPSK 


"■L-R b / 2 


n- l/i 


n ■ L 


i 


2 / 11 



We notice that the bandwidth occupancy and the processing gain for 
BPSK constellations are twice as those of QPSK (or dual BPSK), while the 
spreading factor is the same for all of the cases. The maximum number of 
active users N is given by the size L of the WH spreading codes set for 
those schemes employing only one sequence per uses, whilst it is half the set 
size for those schemes assigning two different codes (one for the 1 and one 
for the Q stream) to each user. The last column presents the spectral effi- 
ciency evaluated from (2.105) and demonstrates that Q-RS is the most effi- 
cient scheme while CS is the least one. 

In the case of advanced communication systems supporting different 
kinds of services (e.g., voice, video, data), the user bit rates can be variable 
from a few kbit/s up to hundreds of Mbit/s. In these cases the spreading 
scheme will be flexible enough to easily allocate signals with different bit 
rates on the same bandwidth. This can be achieved by maintaining a fixed 
chip rate R c (and therefore a fixed spread spectrum bandwidth B (SS) ) and by 
concurrently varying the spreading factor M according to the bit rate of the 
signal to be transmitted. This should also be done without altering the prop- 
erty of mutual orthogonality outlined above. The solution to this problem is 
the special class of codes named Orthogonal Variable Spreading Factor 
(OSVF) codes [Ada97], [Din98], The OVSF code set is a re-organization of 
the Walsh-Hadamard codes into layers. The codes on each layer, as is 
shown in Figure 2-13, have twice the length of the codes in the layer above. 
In addition the codes are organized in a tree, in which any two ‘children’ 
codes on the layer underneath a ‘parent’ code are generated by repetition, 
and repetition with sign change, respectively. 

The peculiarity of the tree is that any two codes are not only orthogonal 
within each layer (that is just the complete set of the Walsh-Hadamard codes 
of the corresponding length), but they are also orthogonal between layers (af- 
ter extension by repetition of the shorter code), provided that the shorter is 
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not an ancestor of the longer one. As a consequence we can use the shorter 
code for a higher rate transmission with a smaller spreading factor, and the 
longer code for a lower rate transmission with a higher spreading factor (re- 
call that the chip rate is always the same). The two codes will not give rise to 
any channel crosstalk (MAI). 
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Figure 2-13. The OVSF codes tree. 

In the case of the uplink of a wireless cellular system, the DS/SS signals 
within a single cell (or beam) are originated from sparse terminals which 
have different signature epochs, thus resulting in Asynchronous CDMA (A- 
CDMA). The received signal, after baseband conversion and under the hy- 
pothesis of perfect timing and carrier recovery for the user 1 (i.e., the desired 
one) can be obtained by modifying (2.72) as follows 

r(t) = A (i) ■ £ df k \ u • eg • g T (t - kT c ) 

k=- oo 

N oo 

+X A {,] • e ~ J,2,tA/ ''* 0 ' 1 jr JJg . c g • g T (t - kT c -x,.) + w(t) , (2. 106) 

1=2 k=-<x> 

where x ( . , 9, and A[ represent the timing, carrier phase, and carrier fre- 
quency offsets, respectively, of the rth interfering channel with respect to the 
useful one. Notice also that, differently from the downlink described by 
(2.72), the interfering signal powers P s 0) in the uplink described by (2.106) 
are, in general, unequal. This is owed to the different propagation loss ex- 
perienced by each user signal originating from a different spatial location, 
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which can be only partially compensated for by means of a power control 
algorithm. Such a power unbalance is expressed by the useful to single inter- 
ferer power ratio , defined as follows 

C/l\. =pW/P s (,) , (2.107) 



and the amplitude of the interfering signals (2.71) becomes 
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In this case the CDMA signal model (2.106) is 
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(2.108) 
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(2.109) 



In order to simplify system description and/or analysis we assume an 
MAI model with equi-powered interfering users, P, = P; 1 ' , Vi > 2 . We can 
therefore resort to a unique C!I ratio, defined as 



Cl I = I P, ■ 



(2.110) 



The total amount of MAI affecting the useful signal can be expressed by 
means of the following useful to total interfering power ratio 
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tot 1 MAI 



( 2 . 111 ) 



which in the case of equi-powered interferers becomes (see (2.110)) 
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It is fairly apparent that asynchronous access does not allow MAI cancel- 
lation by using orthogonal sequences. In this case the decision strobe can be 
still expressed by (2.85) in which the statistics of the MAI term (3 t are de- 
termined by cross-correlation properties and other parameters of the interfer- 
ing signals. In a first approximation, assuming long spreading codes and 
ideal power control, the BER performance of the link can be computed via 
the Gaussian approximation (2.92). 



4. MULTI-CELL OR MULTI-BEAM CDMA 

As outlined above, multiple access can be granted with DS/SS signals by 
assigning different spreading codes to different users. This can be done both 
in the downlink of a terrestrial radio network (base to mobile) with synchro- 
nous orthogonal codes, and in the uplink (mobile to base) with asynchro- 
nous, pseudo-noise codes. But a problem arises when we run out of codes, 
and more users ask to access the network. With reasonable spreading factors 
(up to 256), the number of concurrently active channels is too low to serve a 
large users population like we have in a large metropolitan area, or a vast 
suburban area. This also applies to conventional FDMA or TDMA radio 
networks where the number of channels is equal to the number of carriers in 
the allocated bandwidth or the number of time slots in a frame, respectively. 
The solution to this issue lies in the notion of cellular network with fre- 
quency re-use as outlined in Section 2 of Chapter 1 . Of course, frequency re- 
use has an impact on the overall network efficiency in terms of users/cell (or 
users/km 2 ) since the number of channels allocated to each cell is a fraction 
1 IQ of the overall channels allocated to the provider, where Q is the fre- 
quency re-use factor (the number of cells in a cluster). The same concept of 
coverage area partitioning with channel re-use applies to multi-beam satellite 
networks as those envisaged in Section 3 of Chapter 1. So to both kind of ra- 
dio networks the technique of universal frequency re-use with CDMA sig- 
nals (Section 2 in Chapter 1) is applicable as well. Focusing on the 
downlink, universal frequency re-use means that the same carrier frequency 
is used in each cell/beam (Figure 1-5), and that the same orthogonal codes 
set (i.e., the same channels) are used within each cell/beam on the same car- 
rier. Of course, something has to be done to prevent neighboring users at the 
edge of two adjacent cells/beams and using the same WH code to heavily in- 
terfere with each other. The trick consists in using a different scrambling 
code on different cells/beams to cover the channelization WH codes as in 
(2.102). In a sense, we use a sort of code re-use technique, where code refers 
to the (orthogonal) channelization codes in each cell/beam. 
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Let us focus our attention on the detection of a DS/SS signal in the case of 
a multi-cell (or -beam) multiuser CDMA system made of H cells (or 
beams) with radius R , whereby N users are simultaneously active within 
each cell (or beam). For explanatory purposes, in the following we will refer 
to a cellular mobile radio network, like that depicted in Figure 2-14, with 
H =7 hexagonal shaped cells, whereby a Base Station (BS) is placed at the 
center of each cell. 






Figure 2-14. Geometry of a cellular network. 

We start our analysis with the downlink. Each BS transmits a CDMA sig- 
nal made of N traffic channels with synchronous orthogonal spreading so 
that the resulting multiuser traffic signal originated from each cell is similar 
to that in (2.70). Notice however that every BS assigns the same power to all 
of the signals. The universal frequency re-use causes the Mobile Terminal 
(MT) located inside cell 1 (the reference cell) to receive II multiplex sig- 
nals in S-CDMA format arriving from all the BSs of the network. We re- 
mark that owing to different propagation times and lack of synchronization 
among the BSs, the overall signal received by the MT is made of an asyn- 
chronous combination of the H multiplex signal from the BSs 
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k =-oo 

where a two-index notation is used for each traffic signal to denote both 
the spreading code (index i) and the cell/beam (index h ). We also denoted 
with /l (/,i the amplitude of the traffic channel received from the generic /?th 
cell (see definition (2.71)), and with i h , 0 /; , and A/),, the timing, carrier 
phase, and carrier frequency offsets, respectively, of the CDMA multiplex 
from cell h with respect to that of the signal received from the reference cell 
(cell 1). We have then ^=0, 0j = 0 , and A/J=0. We can decompose 
(2.113) as 
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where the first term at the right hand side represents the useful traffic signal, 
the second one the intra-cell MAI 
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and the third one the inter-cell MAI 
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(2.116) 



Applying the same kind of processing as discussed in (2.72)-(2.85) we 
obtain the samples at the chip matched filter output from channel 1 (see 
(2.74)) 
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K+iiT" 1 (2-117) 

where £j‘ ntra) = b {mtri) (t m ) and b,J‘ nter) = b ( ' nlcr> (t m ) . Similarly the decision vari- 
able is (see (2.85)) 

4 U) = A {1) ■ d ( " ] + pl iMra) + p[ inter) + v t , (2.118) 

where the terms pj m,ra) and pj mter) represent the intra- and inter-cell MAI, re- 
spectively (see (2.82)). The use of orthogonal spreading codes eliminates the 
effect of intra-cell MAI ( p‘ llllra) = 0) and the detection strobe simplifies then 
to 



;(U) 



= A 



(1) -J| u) +p< in,er) 



+ v. 



(2.119) 



Furthermore, in the case of long pseudo-random spreading codes and 
large number of active users, the inter-cell MAI contribution can be modeled 
as a complex Gaussian random variable p' mter) = p ( / "“ cr) + jP^T' whose 1/Q 
components are independent, identically distributed, zero mean Gaussian 
random variables with variance (see (2.84)) 



a' 







(2.120) 



where we introduced the PSD of the inter-cell CDMA interference /< mter) . 
Similarly to (2.89) we obtain 

= a’ = /f er) It, = P^/M (2.121) 

P/ P Q 



where the power of the inter-cell MAI is given by 

P^T ] =tt P * h) = N • it P s H) (2.122) 

h=2 i=l h=2 

and the signal powers Pf ] are related to the received signal amplitudes A h 
as in (2.71). In a typical urban environment it is found that the power of a 
radio signal decays with the distance from the source according to the fol- 
lowing law [Sei91] 



p l h) =K/dt, 



(2.123) 
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where d h represents the distance between the /zth BS and the MT, whiled 
is a constant factor depending on the transmitter power level, antennas gains 
and carrier frequency, which can be therefore assumed equal for all of the 
signals. 

The exponent t, is the so called path loss exponent, and it is found to as- 
sume values in the range 2-^8, depending on the kind of propagation envi- 
ronment [Lee93]. A typical value for urban areas is C, = 4 . 

In the case of an user located at distance d x from the reference BS as in 
Figure 2-14, we find the following distances [Gia97] measured with respect 
to the BSs of the surrounding cells, and expressed as a function of d 1 

d 2 (rf, ) = d n (4, ) = ^3R 2 + d 2 — 3 Rd x , 

d 3 (d x ) = d 6 (d { ) = y]3R 2 +dt, (2.124) 

d 4 (^d i ) = d$ (d x ) = yj3R + d^ + 3jRd^ , 

and the received power levels become 

Ps 2) =P? ) =K/ 4, 

P®=I* 6) = K /4, ( 2 . 125 ) 

p(4) = p(S) =K /dl 

The MAI power (2.122) is 

Pj^ l] = 2 N ■ (if > + if* + P } 4) ) (2.126) 



and the useful to total interfering power ratio (2.1 1 1) is 

C P®_ Jf 

I m PuT ] 2 N ■ (if ] + if* + > ) ’ 



(2.127) 



The BER in the presence of inter-cell MAI is obtained from (2.88), by 
simply substituting I 0 with / ( , ) " ltL ’ 1 ’ 1 as in (2.121). Combining (2.122) with 
(2.125) we obtain the following BER 
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(2.128) 



which depends on the distance d x of the MT with respect to the reference 
BS. We evaluate first the error probability (2.128) for a MT located in the 
Worst Case (WC) user location, i.e., in the farthest point from the reference 
BS (see Figure 2-14). A user placed in such a location receives the minimum 
power of the useful signal from the reference BS (no. 1) and the maximum 
of interference from the closest interfering BSs (no. 2 and no. 7). Letting 
d t =R, with some geometry, the 'BS to MT’ distances d h (2.124) are found 
to be 



1^2 d 

d 3 =d 6 = 2R, (2.129) 

d 4 = d 5 = yJlR. 



If we are interested in a less pessimistic case, or in a sort of Average Case 
(AC), we need a statistical model for the spatial distribution of the MTs 
within the reference cell. A reasonable assumption consists in considering all 
the locations within the cell as equally probable. Toggling now, for the sake 
of simplicity, to a circular cell model such as that in Figure 2-15, the 
probability that the MT lies within any region of area S all within the cell is 
given by 

P(S) = -^. (2.130) 



Also, the probability that the MT is located at a distance x ( 0 < x < R ) 
from the BS is given by the probability that it lies inside the circular corona 
having infinitesimal width dx and radius x, represented by the grey shaded 
region in Figure 2-15. Such probability is given by 



, „ dS 2tix dx 2x , 

di 3 = r- = 7— = — dx . 
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(2.131) 
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The probability density function of the random variable X representing 
the distance from the BS to the MT is then 

ap j x 

n ( x ) = _ = -i 0 <x<R. (2.132) 

xy ’ dx R 2 

Finally, we compute the mean value of the distance 

r lx = E {^}= £x/> x (x)dx= j* ^-dx = |i?. (2133) 

Letting d x =r\ x = 2R/3 , we can now evaluate the BER (2.128) for the 
AC user location. With some geometry the ‘BS to MT’ distances d h (2.124) 
are now found to be 

d 2 =d 7 = y/5 R / 2, 

d 3 =d 6 = yfilR / 2, (2.134) 

d 4 =d 5 = 1R/2. 




Figure 2-15. Circular cell model. 

The uplink is typically based upon asynchronous random access from 
MTs to the BSs, and therefore any receiving BS experiences both intra- and 
inter-cell asynchronous MAI. The overall signal received by the reference 
BS in the uplink is then made of an asynchronous combination of the signals 
originated from all of the MTs active users within all the cells and can be put 
in a form similar to (2.106) 
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(2.135) 



Notice that owing to the lack of synchronicity amongst all the transmit- 
ters and to the different user positions, any MT contribution is characterized 
by a different amplitude A 0,h) , timing x i h , carrier phase 9, , and carrier fre- 
quency A f i h offsets. We can decompose (2.135) as 






N 

I ■ 

i=2 


-j(2»v; !(+0, j) y J(C,1) 
^ l A 1 « 

fc =—ao 


H N 

YL* 

h= 2 i"=l 


i,h) . e “j( ^ Ji 
k=- oo 






+w(t). 



(2.136) 



where the first term in the right hand side represents the useful traffic signal, 
the second one the intra-cell MAI 
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and the third one the inter-cell MAI 

£(inter) • e “ j( 2 "^^' +e a) 

h = 2 i=l 

00 



(2.138) 



As is shown in Section 5 below, the MAI terms can at times overwhelm 
the useful signal component. To prevent this, modern CDMA systems im- 
plement some form of power control, so that all the signals originated from 
the MTs located within the generic /ith cell are received with same ampli- 
tude, say A (h) , by the relevant BS. The intra-cell contribution then becomes 

£<in,ra) = £ jd) . g ^ _ /y _ ^ ) (2 . 139) 
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By applying a procedure similar to that described by (2.72)-(2.85), and 
referring to the useful traffic channel, represented by user 1 of cell 1, we can 
derive the following expression for the decision strobe (which is formally 
identical to (2.118)) 

4 U) = A (l) • J< u) + p< in,ra) + p<“ ,er) + v k , (2. 140) 

where the samples pj t mtra) and p* in,er) represent again the intra- and inter-cell 
MAI residual disturbance, respectively. Owing to the asynchronous random 
access from MTs to the BSs adopted in the uplink, intra-cell orthogonality 
can no longer be invoked, and, differently from (2.119), P* mtra) is not null. 

Let us start by considering for now the issue of an uplink affected by in- 
tra-cell interference only, as in the case of a single-cell scenario. In the usual 
case of long codes and large number of active users, the inta-cell contribu- 
tion can be modeled as a complex Gaussian random variable denoted as 
p(mtra) _ ptmtni) + j p^ a) whose I/Q components are independent identically 
distributed zero mean Gaussian random variables with variance (see (2.84)) 
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(2.141) 



where / ( l l " ltra, is the PSD of the intra-cell CDMA interference. Similarly to 
(2.89) we obtain 
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(2.142) 



where is the power of the intra-cell MAI, which, in the case of perfect 

power control, is given by 



^=±^^-1)^, (2.143) 

i=2 

and the signal power P (,) is related to the received signal amplitude A, as in 
(2.71). The useful to intra-cell interfering power ratio (2.11 1) is now 
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(2.144) 
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The BER for the uplink of a single-cell case, i.e., in the presence of intra- 
cell MAI only, is obtained from (2.88) by simply substituting I 0 with / ( ( ) "" r:ii 
as in (2.142-143) 
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M N 0 



1) 2 E h 






P{e) = Q 



1 N n 



Let us consider now the extension of the analysis above to the more gen- 
eral case of an uplink in a multi-cell scenario. It must be remarked that 
power control makes the interfering power received at the reference BS from 
the surrounding cells to be dependent on the random distance between the 
interfering MT and the BS serving the cell containing that particular MT. 
The issue of MAI evaluation in the uplink of a multi-cell network gets rather 
involved and will not be presented here [New94], We will just remark that 
the outcome of such investigation is the evaluation of a coefficient, the inter- 
cell interference factor , defined as 



<P = 



p( inter) 
r MAi 
p(intra) 
-*MAI 



(2.146) 



so that the total MAI experienced by the reference BS can be expressed as 

p = p( i"“) + p(inter) _( 1 + O \/H\ 

1 MAI 1 MAI T 1 MAI t 1 ^ t J 1 MAI • ' ) 

The value of tp depends on the path loss exponent C as shown in Figure 
2-16. For C, = 4 and in the presence of perfect power control, we have 
cp = 0.55 [New94], [Vit95], From (2.143) we obtain 

P UM ={l + y){N-l)P?\ (2.148) 



and the BER is obtained by a simple modification of (2.145) 
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Figure 2-16. Inter-cell interference factor vs. path loss exponent 

Finally, Figure 2-17 presents the BER performance relevant to some 
downlink and uplink configurations, evaluated for L = 64 , E b i N 0 = 9.6 dB 
and C, = 4 . Crosses and triangles refer to the downlink for the worst and av- 
erage case, respectively, and have been plotted by using (2.128) with the set- 
tings (2.129) for the WC and (2.134) for the AC. Dots and squares refer to 
the uplink, under the hypothesis of perfect power control, for the single and 
multi-cell scenario respectively and were derived using (2.145) and (2.149) 
with ip = 0.55 . 



5. INTERFERENCE MITIGATION RECEIVERS 
FOR THE DOWNLINK 

The discussion in the previous Section has outlined the main issue that 
affects CDMA systems as far as capacity and/or quality of service are con- 
cerned, namely, interference. This is certainly true for the asynchronous up- 
link wherein interference in the form of MAI is generated within one’s own 
cell (intra-cell interference), but applies to the downlink as well, where asyn- 
chronous, non-orthogonal inter-cell interference is generated by adjacent 
cells operating on the same carrier frequency in the same coverage area. 
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Figure 2-17. BER vs. cell loading for downlink and uplink configurations 

The seminal work by Verdu [Ver86] and almost two decades of related 
research have shown how to cope with such issues [Due95], [Mos96], 
[Las97], [Ver97], [Hon98], MAI in the uplink can be counteracted by the 
adoption at the BS of a suited joint or Multiuser Data Detection scheme 
(MUD) that jointly performs demodulation of all of the uplink data streams 
in a single, centralized signal processing unit. With centralized detection the 
presence of a certain amount of MAI coming from interfering channels can 
be accounted for when demodulating the useful, intended channel. This 
standpoint applies reciprocally to all of the channels, leading to the above- 
mentioned notion of ‘multiuser’ demodulation. To be more specific, a con- 
ventional arrangement of the BS channel demodulator is shown in Fig 2-18. 
The data demodulation unit is just architected as an array of independent 
single-channel demodulators in the form of conventional matched filter de- 
tectors, like that depicted in Figure 2-10. Those detectors are optimum in the 
AWGN environment - they simply ignore the issue of MAI and lead to 
strong sub-optimality. 

In contrast, a multiuser detector is a centralized data demodulation unit 
whose general scheme is depicted in Figure 2-19. By concurrently observing 
all of the matched filter outputs, MAI can be taken into account when detect- 
ing data of each channel, and it can be mitigated or ‘cancelled’ by suitable 
signal processing. This ‘interference cancellation’ feature of MUD relaxes 
the demand on the cross-correlation properties of the user signature se- 
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quences since good BER performance can be attained even in the presence 
of non-orthogonal, asynchronous MAI. This gives a new insight to the issue 
of CDMA detection, in that it suggests a way of increasing the capacity of 
the CDMA system while keeping the quality of the link (i.e., its BER) con- 
stant. This two-fold leap forward in the performance of the CDMA receiver, 
not surprisingly, comes at the expense of a substantial increase in the com- 
plexity (signal processing power) of the demodulator. The front end of the 
centralized MUD scheme in Figure 2-19 consists of the same bank of N cor- 
relators with the individual N user signature waveforms that we find in the 
multiple conventional CRs in Figure 2-18. The real core of MUD lies in the 
elaborate post-processing of the array of matched filter outputs (sufficient 
statistics) [Mos96]. 




Figure 2-18. Bank of conventional single-user detectors in the uplink 

The optimum AWGN MUD scheme originally proposed and analyzed by 
Verdu [Ver86] encompasses a Viterbi Algorithm (VA) to perform parallel 
maximum likelihood sequence estimation. Since the number of states in the 
trellis of the VA is equal to 2 N , a receiver with this optimum structure has a 
complexity that is exponential in the number of users N, and therefore it does 
not easily lend itself to practical implementations. Several different sub- 
optimum MUD structures have therefore appeared since then, all of them at- 
tempting to reduce the estimation complexity by replacing the Viterbi de- 
coder by a different device (decorrelating detector, successive cancellations, 
and so on [Mos96]). 

A further constraint which has to be satisfied to allow implementation of 
MUD is that the CDMA signals to be used in the multiple access network 
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bear short codes, i.e., L = M , so the cross-correlations between the different 
spreading codes are well defined and possibly different from each other. 
MUD in its different forms relies in fact on the cvclostationarity on a code 
period of CDMA signals, that cannot be exploited in the case of long spread- 
ing codes, as is unfortunately the case of the IS-95 uplink. The long code is 
introduced in IS-95 to randomize the user signals and to make MAI look as 
much white and Gaussian as possible (with complete destruction of cyclosta- 
tionarity of MAI). MAI is then dealt with by means of powerful low rate er- 
ror correcting codes that tame out the influence of interference. This is a 
radically different ‘CDMA philosophy’ [Vem96] with respect to short codes 
and MUD. 




Figure 2-19. General scheme of a multiuser detector. 

To give a hint of how the MUD receiver work, let us assume that we in- 
tend to demodulate a three-channel CDMA signal with real spreading of 
real-valued symbols. To simplify matters we will also assume that the three 
channels (codes) are synchronous, but they are not orthogonal. Although this 
is not fully representative of the situation encountered in the uplink (asyn- 
chronous non-orthogonal), it leads to similar results. Assuming ideal code 
timing synchronization, the symbol rate sampled outputs of the code 
matched filters in the front end of the MUD in Figure 2-4 are as follows 
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where p mk = p km is the shorthand notation for cross-correlation coefficient 
R „ ?v , between the two spreading codes of channel m and channel k, A U ) 
is the amplitude of signal on channel i , df is the Mi data symbol on chan- 
nel i , and vj° is the rth noise component. The effect of MAI is apparent, 
and it is also clear that it can be potentially destructive. Assume for instance 
that pj 2 ^ 0 (codes 1 and 2 are not orthogonal) and that A' 2 ' » A n> : the 
MAI term p n A <2) d k 2> in (2.150) may overwhelm the useful term A <l> d k > for 
data detection of channel 1. This phenomenon is called the near-far effect : 
user 2 can be considered as located near the receiver in the BS, thus received 
with a large amplitude, whilst user 1 (the one we intend to demodulate) is the 
far user and is weaker than user 2. Generalizing to N users, equations 
(2.150) can be easily cast into a simple matrix form. If we arrange the cross- 
correlation coefficients p, k into the correlation matrix 
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(2.151) 



and if we also introduce the diagonal matrix of the user amplitudes 
A = diag|^ (1) ,^ t2) ,...,^ liV) } , we have 

z = ARd + v (2.152) 

where the column vectors z , d , and v simply collect the respective sam- 
ples of received signal, data, and noise. A simple multiuser detector is the 
decorrelating detector that applies a linear transformation to vector z to 
provide N ‘soft’ decision variables relevant to the N data bits to be esti- 
mated. Collecting such N decision variables g ( f into vector g , the linear 
joint transformation on the matched filter output vector z is just 

g = R z (2.153) 

where R is the decorrelating matrix, so that 

g = R 1 (ARd + v) = Ad + R v = 

= diag [A l d ( £ ) ,A 2 df\...,A N d { ? ) } + V (2.154) 



We have thus 




2. Basics of CDMA for Wireless Communications 



69 



g = [g«,gf ) ,-,gf ) J (2-155) 

where the superscript T denotes transposition, gf = A (,) d k ) + v( (,) , and v( (,) 
is just a noise component. It is apparent that the MAI has been completely 
cancelled (provided that the correlation matrix is invertible) or, in other 
words, the different channels have been decorrelated. The drawback is an 
effect of noise enhancement owed to the application of the decorrelating ma- 
trix R : the variance of the noise components in v' is in general larger than 
that the components in v . Therefore, the decorrelating detector works fine 
only when the MAI is largely dominant over noise. 

A different approach is pursued in the design of the Minimum Mean 
Square Error (MMSE) multiuser detector: the linear transformation is now 
with a generic NxN matrix Z whose components are such that the MMSE 
between the soft output decision variables in g and the vector of the data 
symbols is minimized 

g = Zz , with Z such that E ||g -d|'j = min , (2.156) 

where E{-} denotes statistical expectation. Solving for Z we have 

Z = (r + g(A- 2 )~\ (2.157) 

with ct 2 indicating the variance of the noise components in (2.150). The 
MMSE detector tries to optimize the linear transformation both with respect 
to MAI and to noise. If noise is negligible with respect to MAI, matrix 
(2.157) collapses into the decorrelating matrix R . Vice versa, if the MAI is 
negligible, the matrix Z is diagonal and collapses just into a set of scaling 
factors on the matched filters output that do not affect data decisions at all 
(and in fact in the absence of MAI, the outputs of the matched filters are the 
optimum decision variables without any need of further processing). 

From this short discussion about MUD it is clear that in general such 
techniques are quite challenging to implement, either because they require 
non-negligible processing power (for instance, to invert the decorrelating or 
the MMSE matrices), and because they also call for a priori knowledge or 
real time estimation of signal parameters, such as the correlation matrix. But 
the potential performance gain of MUD had also an impact on the 
standardization of 3G systems (UMTS in Europe), in that an option for short 
codes in the downlink was introduced just to allow for the application of 
such techniques in the BS [Ada98], [Dah98], [Oja98], [Pra98], 
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How can MUD or related techniques be applied to the downlink of the 
wireless system? Multiuser detection in the user terminal (the mobile phone) 
has no meaning at all, since the UT is by definition a single-channel de- 
modulator. Also, if channel equalization is good, hence channel distortions 
are negligible, no MAI is experienced in the downlink, since the channeliza- 
tion codes are orthogonal. Nonetheless, the downlink experiences inter-cell 
interference, especially when the UT is close to a neighboring cell boundary. 
Therefore a single-channel Interference Mitigating Detector (IMD) is some- 
thing the downlink would surely benefit from. Our previous consideration 
about the two-sided effect of interference mitigation applies to the downlink 
as well: the IMD can be used either to improve the quality of the link for a 
given level of interference, or it can be used as an instrument to increase 
network capacity for a given quality of the link. 

How can we implement an IMD? We start by recalling the signal samples 
at the chip matched filter output (see (2.117)) 

y. = ^ ■ <£> + if 1 + + ». (2.158) 

where ^' ntra) and h]' ntral denote the /nth sample of the intra- and inter-cell in- 
terference term, respectively. The two terms together make up a disturbance 
term that is independent of the useful signal component and adds up to the 
background noise. Also, the total disturbance term D m =£h ntra ) + jj( inter ) j s nQ f 
white as, in contrast, h m is: the standard CR with the code matched filter (or 
the despreader-accumulator cascade) is no longer optimum. It makes sense 
therefore correlating the received signal samples with a set of coefficients 
that is not equal to the values of the spreading code c f . The decision vari- 
able for the Mi data symbol on channel 1 will be thus equal to 

(2159) 

L m= 0 

where the coefficients h m (from now on we will omit the superscript (1) for 
simplicity) have to be designed according to a suited optimization rule. 
Equation (2.159) can also be interpreted as the response to the input y m of a 
linear filter whose coefficients are just h m , downsampled to the symbol rate. 
The simplest yet most effective criterion to design the filter coefficients is 
again the MMSE rule, of course this time in a simplified single-channel ver- 
sion with respect to (2.156) 
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(2.160) 



We have to face an issue similar to that encountered in the MMSE MUD; 
specifically, how to set the filter coefficients in order to solve the minimiza- 
tion problem just stated. The solution of this issue leads to the concept of an 
adaptive detector, whose coefficients are adapted in real time so as to mini- 
mize the mean square error and thus build ‘on the fly’ an optimum linear de- 
tector for the configuration of interference that the reference channel is ex- 
periencing. We skip the detailed solution of the minimization problem 
[Mad94] to report here the recursive equation that, starting from arbitrary 
values of the filter coefficients, allows the synthesis of the optimum detector 
configuration 

h(k + \) = h(k)-y(z k -d { ^y(k)\ (2.161) 



where the /.-dimensional vectors h(/c ) and y(k) group the filter coefficients 
at time k and the received signal samples y kL+m , m = 0 , 1 ,...,/, - 1 , respec- 
tively. In (2.161) y is the recursion step size, to be set a compromise be- 
tween fast acquisition (large y ) and small steady state fluctuations (small 
y). The drawback of the adaptive MMSE detector (2.161) is that recursive 
adaptation of the coefficient vector h (k) calls for the knowledge of the 
transmitted data df to compute the error e k = z k - dy . This Data Aided 
(DA) approach can be adopted if a set of pilot symbols is organized into a 
preamble known to the receiver in an initial training phase. At the end of the 
training phase, the detector coefficients are ‘frozen’ and true data detection 
starts. The detector operates thus in Decision Directed (DD) mode. If MAI is 
time varying, adaptation of the coefficients must be periodically carried out 
each time a new preamble of known data appears in the signal framing. Of 
course, this has an impact on the efficiency of the communication link, since 
the preamble data does not convey any information, and contribute to the 
overall data framing overhead. In addition, the recursion (2.161) may require 
a large number of training symbols to attain a steady state condition (long 
acquisition time), making the adoption of a data aided approach impractical. 

Therefore it makes sense to revert to a blind approach that does not re- 
quire the insertion of any pilot symbols or preambles in the data stream. This 
privileges framing efficiency and is also robust in terms of acquisi- 
tion/reacquisition capability. The criterion to be adopted to satisfy this re- 
quirement is the minimization of the Mean Output Energy ( Minimum MOE, 
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MMOE) of the detector instead of the minimization of the squared error as 
before 



1 L-\ . . 

~ z k = . V-A-i such that e{|z,| 2 J = min . (2.162) 



The rationale behind this criterion is that by minimizing the output en- 
ergy the influence of MAI is minimized as well. Of course, we have to add 
some additional constraints to this minimization problem, otherwise the so- 
lution is a trivial one: all coefficients are equal to 0. The trick to avoid the 
coefficients array h(/c ) collapsing to 0 is the anchoring of its value to the 
value it would have in the absence of interference. We know that with no 
MAI the optimum receiver is the conventional correlator, so that in those 
conditions h(/c ) = c , where c is the /.-dimensional array containing the code 
chips c i ( i = 0, . . . L - 1 ) of the desired user. In the general case we set 

h(£) = C + x(&) (2.163) 

where the constraint is that c and x(Ar) be orthogonal. c T \(k ) = 0 (the su- 
perscript T denotes matrix transposition). This is what we called the ‘anchor- 
ing’, and this is also what prevents the coefficients from converging towards 
0. This simple idea, which was introduced by Honig, Madhow, and Verdu 
[Hon95], led to the development of what is called the Extended, Complex- 
valued, Blind, Anchored, Interference mitigating Detector (EC -BALD). De- 
sign of the detector (and adaptivity of the detector as well) is now transferred 
to design and adaptivity of the code orthogonal vector x. 

In a sense, decomposition (2.163) can lead us to interpret the MMOE de- 
tector as the superposition of two detectors: the one characterized by the set 
of coefficients c is the conventional detector which is optimum for the 
AWGN channel. The other ‘additional’ detector x gives the additional fea- 
ture of interference mitigation. It can be shown [Hon95] that the MMOE so- 
lution for x gives also the MMSE solution for h, i.e., h MMSE = c + x M0E . The 
resulting recursive equation for the vector x is 



x(& + l) = x(£)- y z{k) 



y{ k ) 



y(^£cl 

L 



(2.164) 



The second term between brackets is the orthogonal projection of the vec- 
tor of received samples onto the spreading code c. So the EC-BAID is a 
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modified MMOE linear detector operating on the received signal, sampled at 
the chip rate, v„, to yield the symbol rate signal z k as follows 

~z k =jW(k) T f(k), (2.165) 

where y"(k) is the extended 3 /.-dimensional array of the received signal 
samples coming from three symbol periods (i.e., the current period, the lead- 
ing period, and the trailing period) whose elements are denoted as y. 



y e ( k ) 



y-ii k ) 
y 0 ( k ) 
yi( k ). 



and yfk) = [y (k+i)L ,y (k+i)L+l 



‘ y (k+i)L+L-\ 



(2.166) 



and h e (k) is a similarly extended array of detector coefficients. It is appar- 
ent that extension refers to lengthening of the observation window of the 
signal. Such extension is beneficial in terms of the interference rejection ca- 
pability of the detector, especially for asynchronous MAI. The extended de- 
tector can be effectively implemented for real time operation according to 
the three-fold parallel architecture sketched in Figure 2-20, wherein the first 
unit processes the (A- 1 )th, the Mi and the (k + 1 )th symbol periods for the 
detection of the Mi symbol, the second unit processes the /cth, the (k+\ )th 
and the (k+ 2)th periods, for the detection of the (k + 1 )th symbol, and the third 
unit processes the (Ml)th, the (k+ 2)th and the (&+3)th periods, for the detec- 
tion of the (&+2)th symbol. Each detector unit has the structure outlined in 
Figure 2-21. The final soft output data stream is obtained by sequentially se- 
lecting one of the three detector outputs at the symbol rate 1 !T S by means of 
a multiplexer. To better explain operations of the EC-BAID circuit in Figure 
2-20 it is expedient to introduce a further clock reference ticking at what we 
call the Super-Symbol (SS) rate R ss = 1/(3 A), i.e., once every three symbols. 
R ss is basically the operating rate of each of the three detectors. The output of 
the nth detector unit ( n = 1 , 2 , 3 ) is computed as 



f(3s + n- 1) = ~^ en ( 5 ) r y e (35 + n- 1) , 



(2.167) 



with 5 running at super-symbol rate. To achieve blind adaptation, the com- 
plex detector coefficients are anchored to the user signature sequence as out- 
lined above. 
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Figure 2-20. EC-BAID General Architecture. 

Specifically, the extended detectors are characterized by the set of coeffi- 
cients 

'ol h'-M 

h e '"(s) = c e + x e ’"(s), c e = c,x e ’"(s)= x'-(s) , (2.168) 

_oJ [x;(s)_ 

with the ‘anchor’ constraint c 7 x" = 0 , i = -1,0,1, n = 1,2,3 . By trivially 
generalizing the recursive equation (2.164) we obtain the updating rule for 
the interference-mitigating vectors of the three detectors 

x f -"(s + l) = x e> "(s)-ye f -"(s), (2.169) 

with 5 ticking at the super-symbol rate, and where 

>,W 

e"M= e;(s) , 

, , . , . , , . y* (3s + n -l) r c 

e ,.(s) = z(s) y f (3s + 77-1) c , i 



(2.170) 

= - 1 , 0 , 1 , 
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y is the adaptation step and the asterisk denotes complex conjugation. Equa- 
tion (2.169) implicitly assumes that the three detector units are running inde- 
pendently. More architectures can be devised wherein the error control sig- 
nal for the update of vectors x e ’" , whose elements are denoted as x‘ , is 
unique and is obtained as a combination of the partial errors e e [RomOO], 




Figure 2-21. Internal structure of the three detectors in Figure 2-20. 



An example of the interference mitigation capability of the EC-BA1D is 
given in Figure 2-22. We show in the chart the BER of a CDMA receiver 
with asynchronous interference as a function of the number of concurrently 
active users. The spreading factor is 64, the spreading codes are Walsh- 
Fladamard with an Extended PN superimposed as scrambling code, and the 
users delay are uniformly spaced over one symbol interval. The curve la- 
beled BAID is obtained with a MMOE detector observing a single-symbol 
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period, whilst the one labeled EC-BAID is obtained with the three-symbol 
extended detector above. The superiority with respect to the conventional 
correlation receiver is apparent, although it is also apparent that when the 
number of channels gets close to the spreading factor even the IM detectors 
cannot counteract MAI any longer. 

The curves in Figure 2-22 also help to explain how the IM detectors can 
be seen as a technological factor for increasing the network capacity in terms 
of number of served users per cell. Assume that we place a QoS constraint in 
terms of BER of the link, 10 2 just to be specific. The curve of the correla- 
tion receiver in Figure 2-22 says that the maximum number of users in an 
hypothetical cell with that spreading factor is restricted to about 7 (that is, 
the value on the abscissas corresponding to the specified BER). The corre- 
sponding figure on the EC-BAID curve at the same QoS is roughly 38, with 
more than a 5-fold capacity increase! 




Number of Users, N 



Figure 2-22. Interference-mitigating capability of the EC-BAID. 



6. A SAMPLE CDMA COMMUNICATION SYSTEM: 
SPECIFICATIONS OF THE MUSIC TESTBED 

As an example of a practical, we present hereafter the specifications of 
the CDMA system envisaged in the framework of the MUSIC project, spon- 
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sored by the ESA [MU SOI]. Table 2-2 contains the specifications of the 
CDMA signal generator, including signal format, programmability features 
and physical characteristics of analog modulator. Concerning signal format, 
we observe that the specifications indicate a DS/SS-QPSK transmission with 
real spreading (Q-RS) which, according to Table 2-1, yields the best spectral 
efficiency. Two different options are indicated for the set of spreading signa- 
tures: i) a composite code set made of the orthogonal WH sequences over- 
laid by an extended PN for scrambling and cell/beam identification purposes, 
and ii) an extended version of the quasi-orthogonal Gold set, without over- 
laying. In addition the specifications envisage a variable spreading factor M 
in order to make the generator capable of supporting multi-rate transmis- 
sions. Since the modulation scheme is QPSK, the chip rate R c of the useful 
signal is given by 



R c =M R s =nLR s =nL^-, (2.171) 

where L is the signature code period, n represents the number of code peri- 
ods within one symbol interval, R s and R h are the symbol rate and the bit 
rate, respectively. The permitted values of R h , L , and R c , evaluated accord- 
ing to (2.171), are shown in Table 2-3 (maximum chip rate R c max =2.048 
Mchip/s). Full programmability is supported as far as data rates, multiple ac- 
cess interference and additive noise are concerned. The aggregate CDMA 
signal, made of the useful signal and MAI, produced by the MUSIC genera- 
tor is described by the model in (2.109). Eventually the MUSIC generator 
outputs an analog signal centered around the standard Intermediate Fre- 
quency (IF) f F = 70 MHz . The bandwidth occupancy is B IF = (1 + a )R C , 
where a = 0.22 is the roll off factor of the SRRC chip pulse shaping filters 
and its maximum value turns out to be then B n = (1 + a)R c max = 2.56 MHz. 

The receiver specifications are listed in Table 2-4. The CDMA signal 
format specifications are the very same as in the transmitter case. In addi- 
tion, some specifications are reported for the step size of the adaptive detec- 
tor and for the synchronization, namely the spreading code acquisition time 
and the Mean Time to Lose Lock (MTLL). The operating conditions, are ex- 
pressed in terms of Signal to Noise Ratio (SNR) and/or Bit Error Rate 
(BER). The specifications also report the overall maximum SNR degradation 
allowed for the modem (implementation loss) owed to finite-precision 
arithmetic digital processing and imperfect receiver synchronization. Finally, 
the receiver specifications indicate two output formats: i) binary hard de- 
tected Non-Return to Zero (NRZ) data, and ii) soft output samples repre- 
sented by 4 bits. 
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Table 2-2. Specifications of the MUSIC CDMA signal generator 



Feature 


Symbol 


Specifications 


Min. / max. channel 
data rate 


h 


4/ 128 kb/s 


Chip rate 

Signature sequence 
type 


R c 


Programmable 0.128, 0.256, 0.512, 1.024, 2.048 Mchip/s 
accuracy: 1 Hz 

Option I: WH (traffic code) + E-PN (overlay) 

Option II: E-Gold 


Signature sequence 
period 

Modulation/ spreading 
technique 


L 


32, 64, 128 chips 

QPSK / Balanced DS/SS with real spreading 
(i.e., single-code) 


Spreading factor 


M 


n ■ L , n = 1, ... , 16 


Max. no. of 
interfering channels 


N 


L 


Chip shaping 

Random data 
generator stream 


g T (0 


Square root raised cosine with roll off 0.22 

Programmable on each CDMA channel with disable 
capability 


Interferers’ delay with 
respect to the useful 
channel 


T i 


Programmable on each CDMA channel in the 
range 0 L chip intervals, resolution 0. 1 chip 
interval 


Interferers’ phase 
offset with respect to 
the useful channel 


s, 


Programmable on each CDMA channel in the 
range 0 360 degrees, resolution 1 degree 


Interferers’ carrier 
frequency offset with 
respect to the useful 
channel 




Programmable on each CDMA channel in the 
range ±70 kHz, resolution 1 Hz 


Useful channel to sin- 
gle interferer 
power ratio 


Cl I 


Programmable on each CDMA channel in the 
range +10 dB , resolution 0.1 dB 


IF carrier frequency 
Max. carrier 
frequency uncertainty 
on the useful channel 
Output signal level 


/if 


70 MHz 
+100 Hz 

Programmable in the range — 30 -s — 10 dBm , step 5 dBm 


SNR 


EJN, 


Programmable in the range -3 -f 30 dB , step 1 dB 
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Table 2-3. Bit and chip rates of the MUSIC CDMA signal format 



Rb [kb/s] 


n 


Rc [kchip/s] 
@1 = 32 


Rc [kchip/s] 
@1 = 64 


Rc [kchip/s] 
@1= 128 


4 


i 


- 


128 


256 




2 


128 


256 


512 




4 


256 


512 


1024 




8 


512 


1024 


2048 




16 


1024 


2048 


- 


8 


1 


128s 


256 


512 




2 


256 


512 


1024 




4 


512 


1024 


2048 




8 


1024 


2048 


- 




16 


2048 


- 


- 


16 


1 


256 


512 


1024 




2 


512 


1024 


2048 




4 


1024 


2048 


- 




8 


2048 


- 


- 




16 


- 


- 


- 


32 


1 


512 


1024 


2048 




2 


1024 


2048 


- 




4 

Q 


2048 


- 


- 




O 

16 


- 


- 


- 


64 


1 


1024 


2048 


- 




2 

4 


2048 


- 


- 




8 

16 


- 


- 


- 


128 


1 

2 

A 


2048 


- 


- 




w 

8 

16 


- 


- 


- 
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Table 2-4. Specifications of the MUSIC CDMA receiver 



Feature 


Symbol 


Specifications 


IF carrier frequency 


/if 


70 MHz 


Max. carrier frequency uncertainty 




±100 Hz 


Input signal dynamics 




-40 -=--10 dBm 


Max. power unbalance between 
traffic channels 




±6 dB 


Min. input SNR 


V*o 


-1 dB 


Min. / max. channel data rate 


R h 


4/ 128 kb/s 


Chip rate 


R c 


Programmable 0.128, 0.256, 0.512, 1.024, 
2.048 Mchip/s with accuracy: 1 Hz 


Signature sequence type 




Option I: WH (traffic code) + E-PN (over- 
lay) - Option II: E-Gold 


Signature sequence period 


L 


32, 64, 128 chips 


Modulation/spreading technique 




QPSK / Balanced DS/SS with real spread- 
ing (i.e., single-code) 


Spreading factor 


M 


n ■ L, n = 1, ... , 16 


Max no. of interfering channels 


N 


L 


Chip shaping 


g T (t) 


Square root raised cosine with roll off 0.22 


Adaptive detector step size 


Ybaid 


Programmable / adaptive with signal ampli- 
tude 


Min. SNR for acquisition and track- 


E C 


-24 dB 


ing @ E s / N 0 = 0 dB 






Mean acquisition time @ 


T 


< 4 sec 


[V^oLn a «d E s / N q = 0 dB 


acq 




Acq. time with 99% @ 
[£ c /(A 0 +/ 0 )] mm and 
E s / N 0 = 0 dB 


T acq 


< 8 sec 


MTLL for code/phase tracking @ 
BER= 8 ■ 10 2 


T ll 


> 3 • 10 3 sec 


Overall SNR degradation on 
AWGN w.r.t. theory @ 

10" 3 <BER < 8 • 10 2 




< 0.5 dB 


Overall SNR degradation on 
AWGN w.r.t. floating point simula- 
tion, with ideal sync./EC-BAID 
configuration @ 

10" 3 < BER < 8 • 10 2 




< 1 dB 


Baseband data output 




Binary hard detected NRZ and 4 bit soft 
output, with NRZ clock signal 




Chapter 3 

DESIGN OF AN ALL DIGITAL CDMA 
RECEIVER 



After introducing the fundamentals of spread spectrum signaling and 
CDMA we are now ready to delve deeply into the details of the system level 
design of a DSP-based CDMA receiver. We will follow a bottom up 
approach, starting from the multirate signal processing to be carried out in 
the front end section of the demodulator, down to the specific subtleties of 
the synchronization and signal detection algorithms. This will result in an 
overall receiver architecture whose description and simulated performance 
will be discussed in detail. 



1. CDMA RECEIVER FRONT END 

This Chapter contains a description of the main basic building blocks of a 
DSP-based, multirate digital CDMA receiver. Starting from the architecture 
of the digital downconversion stage in the MUSIC receiver’s Front End 
(FE), it develops through a description of the signal interpolator to be used at 
the output of the Chip Matched Filter (CMF), and describes the detailed 
design of several sub-systems of an all digital CDMA receiver, with 
particular emphasis on the multi-rate CDMA demodulator and to the 
interference mitigation functionality. 

1.1 Multi-Rate CDMA Signal 

As already detailed in the previous Chapter, the signal at the output of the 
MUSIC signal generator is centered around Intermediate Frequency (IF) 
/ IF = 70 MHz and has a bandwidth occupancy B w = (1 + a)R c , where 
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a = 0.22 is the roll off factor of the chip pulse shaping filter and R c is the 
chip rate. The latter ranges from R c min =128 to R c max = 2048 kchip/s , so 
that he maximum signal bandwidth occupancy turns out to be 
S|Fmax = (1 + d)R cmwi =2.56 MHz. A sketch of the bandwidth occupancy of 
the signal at the output of the signal generator is shown in Figure 3-1. Here, 
as well as in the subsequent drawings, the spectral components of the signal 
are represented as asymmetric with respect to / 1F . This is for illustrative 
purposes only, and is not intended to be a faithful illustration of the actual 
spectrum. The asymmetric spectral shape is helpful in identifying the 
positive and negative frequency components of the received IF signal 
spectrum during the baseband conversion process described in Section 1.3. 




1.2 Receiver Overall Architecture 

The top level schematic of the CDMA receiver is shown in Figure 3-2. 
For ease of discussion, the different functions performed by the receiver can 
be partitioned as follows: 

a) Digital Downconversion Unit (DDU); 

b) Multirate Front End Unit (MRFEU); 

c) Linear Interpolation Unit (LIU); 

d) Code Timing Acquisition Unit (CTAU); 

e) Chip Clock Tracking Unit (CCTU); 

f) Automatic carrier Frequency Control Unit (AFCU); 

g) Signal Amplitude Control Unit (SACU); 

h) EC-BAID Unit, embedding Carrier Phase Recovery Unit (CPRU). 

In addition the receiver also encompasses the Analog Signal 
Conditioning Unit (ASCU) which performs band pass limiting and 
amplitude control of the IF received signal prior to Analog to Digital 
Conversion (ADC), and a digital processing unit devoted to SNIR (Signal to 
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Noise plus Interference Ratio) estimation, denoted as SNIR Estimation Unit 
(SEU). However, we remark that the design issues related to the latter two 
additional units fall outside the scope of this book and therefore will not be 
addressed here. The interested reader can find more details about signal 
conditioning in reference [MUS01] and about SNIR estimation in references 
[Gilch], [Div98], [MUS01]. 

1.3 From Analog IF to Digital Baseband 

Concerning ADC, two different approaches are possible: asynchronous 
and synchronous signal sampling. 

In the first architecture the received signal is passed through an Anti- 
Alias Filter (AAF) and is then fed to the ADC. The latter is controlled by a 
free running oscillator, having no reference whatsoever with the clock of the 
data signal. The ADC sampling rate 1/7) is usually higher than the symbol 
rate by an over sampling factor greater than 2. Timing correction is achieved 
by interpolating the samples available at the matched filter output according 
to the estimates of the timing error. Briefly speaking, the interpolator ‘re- 
synthesizes’ those signal samples at the correct timing instants, that are in 
general not present in the digitized stream. More details on the interpolator 
sub-unit are reported in Section 2.2. 

An alternative architecture involves synchronous signal sampling, and is 
particularly useful with high data rate modems where oversampling is too 
expensive or unfeasible altogether. Here, timing correction is accomplished 
through a feedback loop wherein a Timing Error Detector (TED) drives a 
Numerically Controlled Oscillator (NCO) that adaptively adjusts the clock of 
the ADC, so that the digitized samples are synchronous with the data clock. 

In the MUSIC receiver, the ADC rate is not critical, and the decision was 
to resort to is asynchronous sampling. The received signal undergoes IF 
filtering and is passed to the ADC wherein it is sampled at rate f . The value 
of f is selected taking into account the following requirements: 

i) f > 4 R max = 4 • 2.048 Mchip/s = 8.192 MHz (to yield at least four 
samples per chip); 

it) f = 2" (to select from standard commercial quartz clocks); 

Hi) kf > 2 f ¥ + B lf , ( k - 1 )f < 2/jp - B w (to ensure that the spectral 
replicas arising from ADC do not overlap). 

According to i) and 0), we set n = 14 and f =16.384 MHz. Such value 
of f s was also found to meet condition Hi) for an IF bandwidth 
B l¥ = 2.56 MHz , with k = 9 . 




Figure 3-2. Top level schematic of the MUSIC receiver. 
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The spectrum of the resulting sampled signal is shown in Figure 3-3, 
where the replicas of the positive and negative frequency signal spectrum are 
centered around 

ft ] = 70-kf s , (3.2.a) 

ft } =-70 + k f, (3.2.b) 

respectively, with k an integer. 




Figure 3-3. Spectrum of sampled IF signal after ADC. 

Figure 3-4 zooms on the low frequency part of the spectrum, containing 



the following spectral replicas 




/ 4 (+) =70-4 f s = 4.464 MHz, 


(3. 3. a) 


/ 5 (+) = 70 - 5 f s = -1 1 .920 MHz, 


(3.3. b) 


/ 4 (_) = -70 + 4 /; = -4.464 MHz, 


(3.3.C) 


/H = -70 + 5 /; = 1 1 .920 MHz. 


(3.3 .d) 


The sampled signal is then digitally I/Q downconverted to baseband by a 
Digitally Controlled Oscillator (DCO) operating at the Digital IF (1FD) 
/ifd = 4.464 MHz , as is sketched in Figure 3-5. The downconverted signal 
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contains unwanted image spectra located at the frequencies ± /' and ± f" as 
follows 



ft' - /ifd= 7-456 MHz = /', 


(3 -4. a) 


/ 4 (+) +/,fd =8-928 MHz = f", 


(3-4. b) 


/ 5 (+) +/ifd =-7-456 MHz = -/', 


(3.4.C) 


- fwn = -8.928 MHz = -/" . 


(3.4.d) 


The unwanted image spectra located at ± /' 


and ± f" will be rejected by 


means of the (low pass) CMF. By normalizing with respect to the sampling 
frequency f s , we also get the normalized locations of the spectral images to 


be rejected 




W 7 456 s 0.455, 
f, 16.384 


(3. 5. a) 


r = 8 928 s 0.545. 
f s 16.384 


(3.5.b) 


The basic and conceptual schematic of the I/Q downconverter is shown 
in Figure 3-6, where the blocks labeled ‘CIC’ are in charge of decimation, as 



detailed in Section 1.4. Figure 3-7 shows the implementation of the scheme 
in Figure 3-6. (Digital) i/Q downconversion is implemented by resorting to a 
Direct Digital Synthesizer (DDS) that drives a couple of real multipliers. 
Particularly, the DDS is made up by a phase accumulator and a Look Up 
Table (LUT) storing the sine and cosine samples. 

The phase accumulator performs numerical integration of the digital 
reference at frequency / IFD , whose value is represented by the Frequency 
Control Word (FCW) on « FCW bits, according to the following recursive 
equation computed at the clock rate f s 



4b ~ 4h-l + fwD 5 



(3.6) 
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where T S =V f s is the sampling interval. The phase accumulator, which is 
assumed to operate internally on n acc bits, outputs a (sampled) ramp, 
represented by n ha bits (with n pha < n acc ), whose slope is determined by the 
digital FCW / IFD . The n phc most significant bits of the accumulator are then 
used to represent the phase § k that is passed to a LUT which performs 
sine/cosine generation and outputs the sequences sin(tj) t ) and coslij), ) , 
represented by n DC0 bits. To do this the word representing the phase (|) /: on 
n pha bits is used to address a ROM table containing N Lm samples excerpted 
from a period of a cosinusoid, quantized by 2" DC0 levels. 




Figure 3-4. Particular of Figure 3-3. 





Figure 3-5. Digital downconversion to baseband. 
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Figure 3-6. Basic block diagram of the I/Q digital downconversion to baseband. 




Figure 3-7. Implementation of the I/Q digital downconversion to baseband. 

Let us focus on the issue of quantization of tj),. , sin((|) ( ) and cos(<j), ; ) , and 
let us determine the number of required bits. First of all, assume for the 
moment that the sine and cosine values have so many bits that the 
quantization of § k is significant only. If the phase is quantized by n phu bits 
the unit circle is divided into 2 V “ phased increments, and the quantization 
noise is uniformly distributed, so that the resulting RMS phase noise, 
measured in fractions of a cycle, is 



a, 



2 « pha 

Vl2 ' 



(3.7) 
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For instance, assuming n pha = 8 bits, we have = 1 . 1 x 1 0 3 cycles , 
corresponding to an RMS value of 0.41 degrees, which seems adequately 
small for the detection of QPSK-modulated signals to bear no appreciable 
BER degradation. We remark however that accurate assessment of the actual 
BER penalty associated with n ha can be carried out only by means of bit 
true computer simulations, as is done in Section 5.2. 

Now, let us make the reverse assumption that <j), t is delivered with 
‘infinite’ precision, and that significant quantization takes only place in the 
‘output’ values of the sine and cosine functions. Denote also by q(^) the 
quantized versions of the infinite resolution value S, . In this case, if the sine 
and cosine functions are quantized by n DC0 bits the received signal will be 
rotated by a phase amount given by 



(J>^ = arctan 



q[ sin (4h)] 1 

q[cos(^)]J 



(3.8) 



and, neglecting any amplitude fluctuation resulting from phase quantization, 
the quantization error is 

A( b =<K -<h - (3-9) 



Following the analysis in [Gar88], the RMS value of this error, measured 
in fractions of a cycle, is given by 



cs. 



^ n DCO 

kV12 ' 



(3.10) 



For n DC0 = 8 bits , we have = 3.6xl(T 4 cycles, corresponding to 0.13 
degrees. From the two expressions above it turns out that the RMS 
quantization error on the sine/cosine function values is n times (i.e., about 4 
times) larger than the RMS error caused by quantization of the phase (for the 
same word length). Therefore, if the sine/cosine LUT outputs are quantized 
by the same number of bits as the phase <|), at the LUT input ( n DC0 = n pha ), 
then the effect of quantization on the phase is dominant. Alternatively, if we 
let n DC0 = n pha -2 (i.e., the sine/cosine outputs are represented by a word 
shorter by two bits than the input phase word), then the LUT input and 
outputs provide nearly equal contributions to the net quantization noise. 

In addition to the issue of quantization noise owed to I/Q and/or phase 
quantization, it is also necessary to determine the effect of finite length of a 
word representation in the DCO. The phase rotator described above acts also 
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as a frequency translator according to the following recursive algorithm, 
which is inherently implemented by the DCO 

<b = (<b-l + 2nT s flFD ) mod 271 , (3.11) 

where <\> k is the instantaneous phase used to correct the incoming signal. 
Scaling by 2n we obtain 

^ = f^ + ^/,Jmodl. (3.12) 

ZTC y ZTC j 

Denoting now by the (normalized) phase expressed in cycles 
( & k = (j) t / 271 , 0 < & k <1 ), we obtain 

9*= (»*_,+ 7) modi, (3.13) 



or 

9< = (kT s / IFD ) mod 1 , (3.14) 

which describes operation of the DCO. According to (3.13), at each sample 
time the FCW (i.e., the normalized frequency T s / IFD represented by « FCW 
bits) is added to the previous contents of the accumulator, which operates on 
n acc bits. Overflow management of the accumulator is intrinsic to the 
modulo- 1 operation, with the accumulator contents regarded as a binary 
fraction between 0 and 1 cycle. The exact value of the (normalized) 
frequency f WD T s is provided (at symbol time) by the Frequency Error 
Detector (FED). Phase resolution can be made as fine as desired by 
lengthening the accumulator word length, i.e., by increasing n acc . The Least 
Significant Bit (LSB) of the FCW ( T / IFD ) must be consistent with the LSB 
of the accumulator. If there are n acc bits in the accumulator, the phase is 
quantized into increments of 2 """ cycle. Usually only the n pha Most 
Significant Bits (MSBs) of the accumulator (with n pha < n acc ) are read out to 
the subsequent sine/cosine LUT. The smallest frequency increment is 




(3.15) 



so improved frequency resolution can be obtained by lengthening the 
accumulator (and the relevant FCW). 
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In many practical applications a typical choice for the accumulator word 
length is n acc = 32 , so as to provide a phase resolution 



Ad = 2~ H " = 2~ 32 =2.33xl(T 10 cycles, 


(3.16) 


or, equivalently, 




A(j> = T" acc ■ 2n = 2- 32 ■ 2n = 1.46 x 10~ 9 rad, 


(3.17.a) 


A(|> = 2 x 360 = 2~ 32 x 360 = 8.38 xl(T 8 deg. 


(3.17.b) 


Concerning the frequency resolution we have 




Af = f* A6.384MH, _, Hz 

y 2"““ 2 32 


(3.18) 


and 




^ = 1 = 2.33 xl(T 10 , 

f s 2' w 


(3.19) 


which is unnecessarily accurate for our purpose. By relaxing the 
requirements about the resolution, we may consider a smaller accumulator 
based on n acc = 8 bits, thus obtaining 


Ad = 2~' Vt = 2~ 8 = 3.91 xl(T 3 cycles 


(3.20) 


A(> = 2 -"*•* x27t = 2“ 8 x 2 ti^ 3.91 xl0^ 3 x2n^ 2.45 xl(T 2 rad, (3.21.a) 


A(|> = 2~' w x 360 = 2~ 8 x 360 = 3.91 xl(T 3 x 360 = 1.41 deg, 


(3.2 l.b) 


. r. f s 16.384MHz 
A f = = 64 kHz, 

y 2"“ c 2 8 


(3.22) 
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^- = — = 3.91xl(T 3 . (3.23) 

/, 2 "« 



The feasibility of an 8 bit accumulator will be verified later by means of 
extensive bit true computer simulations. The phase at the accumulator output 
can not have n ha = n acc because of speed limitations in the access to the 
LUT, therefore only the n ha MSBs of the accumulator (with n pha < n acc ) are 
read out to the following sine/cosine LUT. According to the results 
highlighted above, it is common practice to choose n ha = n DC0 + 2 so as to 
limit the truncation error. We assume that signal degradation is negligible if 
the digital phase rotation of the samples is carried out by using 8 bit 
sine/cosine coefficients ( n DC0 = 8 bits ). This assumption seems rather 
conservative and shall be verified, and possibly relaxed, by means of bit true 
simulations. Under this hypothesis we have n pha =10 bits, and we can 
address a number of 2""‘“ = 2 10 = 1 024 different phases. Considering the 
symmetries of the sine and cosine functions, we can reduce the number of 
elements stored in the LUT. Actually, it is sufficient to store in the LUT only 
the samples (represented by 8 bits) of the sine in the interval O-ti/ 2 
(corresponding to 1/4 of a cycle). Recalling that the phase resolution is 1024 
samples per cycle, the LUT must contain only N Lm = 1024/4 = 256 values. 
A pictorial diagram of the LUT architecture is shown in Figure 3-8. 

1.4 Decimation and Chip Matched Filtering 

The received signal is sampled at the rate f s =1/7) = 16.384 Msample/s , 
and the number of samples per chip is therefore given by 



v 



s 



A 

K 



16.384 



(3.24) 



whose possible values for the various chip rates are reported in Table 3-2. 

Notice that chip matched filtering and the subsequent digital processing 
operations (noticeably chip timing synchronization) require no more than 2 
to 4 samples per chip interval. According to (3.24) the signal turns out to be 
significantly oversampled for the lowest bit rates, and this would cause the 
CMF to bear excessive complexity owing to the huge number of taps 
required. Therefore decimation is in order so as to achieve the target 
sampling rate of n s = 4 sample/chip . The decimation factor to be applied is 

_ v, _ fs 16-384 4.096 

P n, n s R 4 R c R 



(3.25) 
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BITS 

#0 #1 #2 #3 #4 #5 #6 #7 




Figure 3-8. LUT architecture. 

A signal sampled at rate f s is decimated by a factor p by taking the first 
sample out of a group of p samples. The decimated signal has frequency 
fd = f s / P • From a spectral point of view this implies that the spectral 
replicas, which in the original signal were located on frequencies m f (with 
m an integer), are moved to mf d = m /’ / p . Unfortunately this operation 
causes aliasing of the wideband noise spectrum, thus reducing the Signal to 
Noise Ratio (SNR). Figure 3-9 shows this phenomenon for a decimation 
factor p = 2 . 

Table 3-1. Chip Rates ( R ), Samples per Chip ( ), Decimation Factors ( p ), and Decimated 



Frequencies ( /, ). 



R 




P 


f. 


0.128 Mchip/s 


128 sample/chip 


32 


0.512 Msample/s 


0.256 Mchip/s 


64 sample/chip 


16 


1.024 Msample/s 


0.512 Mchip/s 


32 sample/chip 


8 


2.048 Msample/s 


1.024 Mchip/s 


16 sample/chip 


4 


4.096 Msample/s 


2.048 Mchip/s 


8 sample/chip 


2 


8.192 Msample/s 
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A 




Figure 3-9. Aliasing of white noise spectrum. 

When decimating a signal we have to ensure by appropriate pre-filtering 
that no aliasing will take place. A low complexity solution for jointly 
performing pre-filtering and decimation is the so called Cascaded Integrator 
and Comb (C1C) architecture [Hog81], depicted in Figure 3-10 and 
described hereafter. The integer N represents the order of the CIC filter, 
while the integer M is termed differential delay. 

From Figure 3-10 the transfer function of the N stage cascaded 
integrator section, operating at sample rate f s , is 






(3.26) 



Integrator Section 



Decimation 

P 



Comb Section 



Stage 1 Stage 2 Stage N ] Stage N+ 1 Stage N+2 Stage 2N 




fs >j« fd 



Figure 3-10. CIC decimation filter. 
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After the integrator section the sample stream is decimated by a factor p , 
down to rate f d = 1 !T d given by 



f d = 



L 

p 



16.384 

P 



16.384 

4.096 



i? =4 7? 



(3.27) 



The numerical values of the decimated rate f d envisaged in the 
framework of the MUSIC project are reported in the last column of Table 3- 
1. Finally, the decimated samples are filtered by the N stage cascaded comb 
section, operating at rate f d , and whose transfer function is 

H' c (z) = (\-z- M ) N . (3.28) 



The transfer function of the comb section can be re-written with respect 
to the input sample rate f s as 

H c (z) = (l-z- pM ) N . (3.29) 



The resulting overall transfer function, expressed with respect to the 
input sample rate f , is then 



hu)-- 






1 — z 



pM-1 \ N 

IX • 

k=0 J 



(3.30) 



The frequency response of the CIC filter is therefore 

-jii/r,(pM-i)jv | sin(jt/T s pM) 



*(/) = 



s ' n (f/7’ s ) 



(3.31) 



where f s =l/T s . The amplitude response is 



H{f) = 



sin 



npM 



s J 



sin 



n 



fs 



V Js J 



sine 



M p- 



p M 



S J 



sine 



f 

\fs J 



(3.32) 



or equivalently, recalling that f d = fj p , 
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H{f) = 



sin 



71 M 



f 

f 



d J 



sin 



v P fd j 



sine 



M 



M p- 



V Jd J 



L 

fd 



sine 



1_ 

R fa 



d J 



(3.33) 



We observe now that 
H(0) = (Mp) N , 

and therefore the normalized frequency response is 

1 sin (nfT s pM) 



n( -]nfT s (pM-\)N 



Mp sin(7i/T s ) 



(3.34) 



(3.35) 



so G(0) = 1 . From the equations above it is apparent that the CIC filter is 
equivalent to a cascade of N ‘moving average’ Finite Impulse Response 
(FIR) stages, each operating at the input sample rate f s and having transfer 
function 



pM-l 

s( z )= 



k=0 



(3.36) 



followed by a decimator which reduces the sample rate by a factor p, as 
shown in Figure 3-11, where the transfer functions H{z) and S(z) are also 
indicated. 

Implementation of the CIC in this second form would be much more 
complex than the original one, and this motivates the introduction of such a 
circuit. In analytical computations, it is sometimes more convenient to refer 
to the latter equivalent moving average model. 

The baseband bandwidth occupancy of the useful signal is now 



B 



BB 




0 + oR 

2 



= 0.61 , 



(3.37) 



and the baseband occupancy normalized with respect to the decimated 
frequency f d is given by 
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p = 1*22. = °- 61 K _ 0 _ 15 25 (3.38) 

f d 4i? e 

which does not depend on the chip rate. Figure 3-12 shows the generic 
frequency response &'(/) , as compared with the various (wanted and 
unwanted) spectral components of the received signal. 




fi | fd 

I 



Decimation 

P 



Figure 3-11. Equivalent model for the CIC decimation filter. 

It is apparent that the amplitude response G(f ) of the CIC filter is not 
flat within the useful signal bandwidth, and therefore some compensation, by 
means of a subsequent equalizer, is required in order to minimize signal 
distortion. We also see that the particular value of the decimation factor p 
determines the location of the frequency response’s nulls at the frequencies 
mf d = m f / p . Such nulls reveal crucial for rejecting those spectral 
components that, owing to the decimation, are moved into the useful signal 
baseband. The differential delay M causes the appearance of intermediate 
nulls in between two adjacent nulls at mf d . These additional nulls are of 
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little utility and do not significantly increase the alias rejection capability of 
the CIC filter. This feature is highlighted in Figure 3-12, where the case 
M =1 (dashed line) is compared with the case M =2 (solid thick line). 
Actually, an increase of M does not yield any improvement in the rejection 
of the unwanted spectral components, while it requires an increase in the 
storage capability of the CIC filter. Therefore according to [FIog81] and 
[FIar97] we will restrict our attention in the sequel to the case M =1. 




Figure 3-12. Generic normalized frequency response of the CIC decimation filter. 

The order N of the CIC filter determines the sharpness of the notches at 
mf d and the amplitude of the relevant sidelobes, therefore it must be 
carefully selected, taking into account the required attenuation of the 
unwanted spectral components. Assuming that a white noise process is 
superimposed on the signal at the CIC filter input, the shape of the frequency 
response G(f) is proportional to the amplitude spectral density (i.e., the 
square root of the power spectral density) of the noise process at the output 
of the CIC, prior to decimation. Decimation causes the (normalized) 
amplitude spectral density G(f) to be translated onto mf d = m fj p . As a 
consequence the useful signal spectrum will suffer from aliasing caused by 
the lobes of the spectral replicas, as clarified in Figure 3-13. 

The total contribution of the aliasing spectral replicas, that we call alias 
profile [Har97], is made of the contribution of p terms, and is bounded from 
above by the function 

p/ 2 

A (f)= I Hf-v d )\ 

k=~ P/2 
k*0 



(3.39) 
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The parameter N therefore keeps the alias profile A(f) as low as 
possible within the useful signal’s bandwidth B bb . 

Figure 3-14 shows the frequency response G(f) for the different 
decimation ratios p in Table 3.2, for M = 1, and 7V = 4, while Figure 3-15 
reports G(f) for different orders of the filter N , forM=l, and p = 8. In 
both the figures G(f ) is plotted versus the normalized frequency / / f s . 




Figure 3-13. Aliasing effect of the CIC filter caused by decimation. 

As already mentioned, the spectrum of the signal at the output of the CIC 
filter, at the decimated rate f d , suffers from amplitude distortion, owing to 
the non-constant frequency response 11(f ) (or, equivalently, G(f ) ). This 
calls for the use of a compensation filter (also termed equalizer) having a 
frequency response H aj (f) given by 

»„(/)= 

such that 

K(f) H (f) l=i- ( 3 - 41 ) 

We will consider the compensation filter G eq (f ) for the normalized 
frequency response G(f ) , that is 



sin(7i//p/ rf ) 
sin (nMf/fi,) 



(3.40) 
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G„(f) 



sin ji 



M p- 



Pfa 



sm 



nM 



d J 

L) 

L 



d J 



such that 

M/)g(/)|=i, 



(3.42) 



(3.43) 



with G cq (0) = 1 . The ideal (3.42) has Infinite Inpulse Response (HR), and is 
approximated in our implementation as an FIR filter with N taps and 
coefficients g ( k eq) A where k = 0,...,( N eq -1). The actual frequency response 
of the equalizer G eq {f) is 

<M/) = (3.44) 

k = 0 



where T d represents the sampling interval after decimation. Owing to the 
truncation of the impulse response we only have G cq (/) = G eq (/) , and 




NORMALIZED FREQUENCY f/f s 



Figure 3-14. Frequency response of the CIC filter, M = \,N=A. 
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NORMALIZED FREQUENCY f/f s 



Figure 3-15. Frequency response of the CIC filter, M= 1, p = 8. 



M°)=I>r*l- (3-45) 

k = 0 

Therefore we consider a re-normalized compensation filter G%(/), 
defined as 



G' eq {f) = 



Geq (/) 
Geq ( 0 ) 



(3.46) 



such that G'eci (0) = 1 . The compensation filter G,,, (/ ) can be synthesized 
according to the technique described in [Sam88], where a suitably modified 
version of the Parks-McClellan algorithm [McC73] for the design of equi- 
ripple FIR filters is used. The algorithm inputs are the length N of the FIR 
impulse response and the bandwidth of = E{«^(m)} = N 0 /T c with 
maximum flatness, after equalization. After some preliminary tries we set 
JV =17, in order to reduce the complexity of implementation, and 
B f = 0.35 f d , so as to minimize amplitude distortion (in band ripple) on the 
signal bandwidth B m =0.1 525 f d . As is apparent from the definition of 
H eq (/) and its related expressions, the frequency response of the equalizer 
depends on the decimation factor p. As a consequence the set of the 
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coefficients of the compensation FIR filter must be computed and stored for 
every value of p , and the filter must be initialized by loading the coefficients 
g[ eq) every time p is changed. 

Figures 3-16 and 3-17 show the frequency response of the CIC compared 
with the alias profile, either uncompensated (dashed curves) or with 
compensation (solid curves), obtained for p = 32 , with N = 4, M = 1 . The 
effectiveness of the equalizer in flattening the frequency response up to 
0.35 f d is apparent. Also, with the parameters specified above, alias 
suppression within the useful bandwidth (B m =0.1525 f d ) turns out to be 
higher than 45 dB. 




0.00 0 05 0 10 0 15 0.20 0 25 0.30 0 35 0 40 0 45 0 50 0.55 0.60 



NORMALIZED FREQUENCY f/f d 



Figure 3-16. Frequency response of the CIC filter and alias profile, 
with (solid) and without (dashed) compensation, M = 1, N = 4, p = 32. 



After equalization, filtering matched to the chip pulse takes place. The 
CMF is implemented with an FIR filter with N CM¥ taps, approximating the 
ideal Nyquist ’s Square Root Raised Cosine (SRRC) frequency response 



G,(/) = 




(3.47) 



where G N (f) is the Nyquist’ s Raised Cosine (RC) pulse spectrum with 
G n (0) = T , and roll off factor a = 0.22 . Preliminary investigation about 
truncation effects in the CMF, carried out by computer simulation, 
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demonstrated that the performance degradation is negligible if the SRRC 
impulse response is truncated (rectangular window) to L = 8 chip intervals. 
The overall length of the CMF impulse response must be at least 

N CMF =L-n s +1 = 8- 4 + 1 = 33 samples . (3.48) 

Considering the symmetry of FIR impulse response, the number of filter 
coefficients to be stored is 




0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 



NORMALIZED FREQUENCY f/f d 

Figure 3-1 7. Frequency response of the CIC filter ( dashed line), compensation filter (solid 
thick), and overall compensated response (solid thin), M = 1, N = 4, p = 32. 

Integration of the compensation filter and the CMF into a single FIR 
filter was also considered. 

However, the design of a single equivalent filter revealed quite a critical 
task. In particular, the resulting filter exhibited intolerable distortion on the 
slope of the frequency response. The consequence was that the two filters 
were implemented separately. 

The resulting architecture of the front end of the MUSIC receiver is 
shown in Figure 3-18. 
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Figure 3-18. Architecture of the MUSIC receiver with the Multi-Rate Font-End. 



2. CDMA RECEIVER SYNCHRONIZATION 

This Section tackles the issue of synchronization in a CDMA receiver, 
starting from a few general concepts, down to the particular design solutions 
adopted and implemented in the MUSIC receiver. 

2.1 Timing Synchronization 

During start up, and before chip timing tracking is started, the receiver 
has to decide whether the intended user m is transmitting, and, in the case 
he/she actually is, coarsely estimate the signal delay r m to initiate fine chip 
time tracking and data detection. 

2.1.1 Code Timing Acquisition 

Consider now the issue of code timing acquisition. In most cases this task 
is carried out by processing the so called pilot signal. This is a common 
CDMA channel in the forward link or a dedicated CDMA channel in the 
uplink, that is transmitted time and phase synchronous with the useful traffic 
signal(s), and whose data modulation is either absent or known a priori. 
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The pilot signature code sometimes belongs to the same orthogonal set 
(i.e., the Walsh— Hadamard set) as those used for the traffic channels. In this 
case, it is common practice to select as the signature of the pilot signal the 
‘all 1 ’ sequence, i.e., the first row of the Walsh-Hadamard matrix. 

However, in some cases it may be expedient to use a signature belonging 
to a different set (hence non-orthogonal) in order to avoid false locks owed 
to high off sync cross-correlation values of the WH sequences. 

This issue will be addressed later when dealing with numerical results. 
Also the pilot signal is usually transmitted with a power level significantly 
higher than the traffic channel(s) (the so called pilot power margin or PIC 
ratio) to further ease acquisition and tracking. 

As is discussed in [Syn98], conventional serial acquisition circuits are 
remarkably simple, but entail a time consuming process, leading to an a 
priori unknown acquisition time. 

Therefore we have stuck to the parallel acquisition circuit for QPSK 
whose scheme is depicted in Figure 3-19. The design parameters of such a 
circuit are the value of the normalized threshold A, and the length W of the 
post-correlation smoothing window. We shall not discuss here the impact of 
such parameters on acquisition performance, since this issue is well known 
from ordinary detection theory. 

Implementation of the CTAU directly follows the general scheme in 
Figure 3-19, and is summarized in Figure 3-20 [De98d], [De98e], The 
CTAU receives the stream of complex-valued samples at rate 2 R c (two 
samples per chip) at the output of the LIU. 

Such an 1/Q signal is processed by a couple of filters matched to the 
spreading code (this operation is also addressed to as the sliding correlation 
of the received signal with the local replica code). Notice that in Figure 3-20 
the front end features two correlators because modulation is QPSK with real 
spreading (i.e., it uses a single code). Also the circuit in Figure 3-19 assumes 
a correlation length (the impulse response length of the front end FIR filters) 
equal to one symbol span, just as in the conventional despreader for data 
detection. 

On the other hand, if we assume an unmodulated pilot there is no need in 
principle to limit the correlation length to one symbol (as, in contrast, is 
needed when data modulation is present). We have thus a further design 
parameter represented by the length of the correlation window. 

For convenience we will investigate configurations encompassing a 
correlation time equal to an integer number, say M, of symbol periods 
(coherent correlation length). 

The correlator outputs, again at the rate 2 R c , are subsequently squared 
and combined so as to remove carrier phase errors. Parallelization takes 
place on the signal at the output of the combiner, still running at twice the 
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chip rate. By parallelizing we obtain a 2 L -dimensional vector signal running 
at symbol time, whose components thus represent the (squared) correlations 
of the received signal with the locally generated sync reference signature 
code, for all of the possible 1 /2-chip relative shifts of the start epoch of the 
latter. 




b) 




x 



L-l 



Figure 3-19. Parallel Code Acquisition Circuit. 

After (parallel) smoothing on the observation window of length W 
symbols we obtain the sufficient statistics to perform signal recognition and 
ML estimation of the signature code initial phase. In particular, the 
maximum among all of the components is assumed to be the one bearing the 
‘correct’ code phase. The CTAU broadcasts such information (denoted to as 
code phase ) to all of the signature code generators that are implemented in 
the receiver (EC-BAID, CCTU, SACU etc.) either for traffic or for sync 
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reference code generation. As is seen, this acquisition device also features an 
adaptive estimator of the noise plus interference level that is used to detect 
presence of the intended sync signal. The circuit also provides an 
information bit which indicates the presence, or the absence, of the pilot 
signal. 




Figure 3-20. Block diagram of the CTAU. 

In our design we set the CTAU parameters so as to obtain: 

i) probability of False (signal) Detection ( P H) ) lower than 0.001; 

ii) probability of Missed (signal) Detection (P UD ) lower than 0.001; 

in) probability of Wrong (code phase) Acquisition ( P WA ) lower than 0.001. 

Such probabilities are sufficiently low so as to enable one to disregard the 
influence on system performance of ‘bad’ events (i.e., acquisition takes place 
with approximately unit probability, and always takes place on the correct 
code phase). Considering the post-correlation smoothing period and the 
coherent correlation time, the total acquisition time is 

W-M -L 

T acq = W-M-T s =W-M-L-T c =— . (3.50) 

K c 

The worst case corresponds to the lowest chip rate R c = 0.128 Mchip/s, 
so that the acquisition time is bounded from above by 
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T WML 

T < us 

acq 0.128 



(3.51) 



Recalling the requirement of the average acquisition time T = 4 s in the 
project specifications, we have 



W ■ M ■ I 

ps< 4xl0 6 ps , (3.52) 

0.128 



which gives 

W -M-Z<512000. 



(3.53) 



Table 3-2 reports the upper bounds of the product W M , referred to as 
latency, for the different code lengths. 



Table 3-2. Upper bounds of the product WM (latency) for the CTAU, 



L 


(W-M) 

x 7 max 


32 


16000 


64 


8000 


128 


4000 



2.1.2 Chip Timing Tracking 

Once signal detection and coarse code timing acquisition have been 
successfully completed, chip timing tracking is started. 

The unit in charge of fine chip time recovery is the CCTU, and is based 
on a non-coherent non-data aided closed loop tracker that closely follows the 
architecture outlined in [DeG93], In this respect Figure 3-21 shows the 
integrated CCTU/L1U. 

As apparent from the figure the outputs of both I and Q interpolators, 
running at the rate 2 R c , are demultiplexed in two low rate ( R c ) signals by 
two demultiplexers. The first signal is obtained collecting those samples 
taken (interpolated) at the optimum sampling instants, and are therefore 
referred to as prompt (or on time) samples. The other stream is made of the 
samples in between two consecutive prompt samples, and are therefore 
addressed to as Early/Late (E/L) samples. The prompt samples are used by 
the EC-BAID for data detection and by the Frequency Error Detector (FED) 
for fine carrier tuning, while the E/L samples are used by the CCTU for fine 
chip clock recovery. 
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More in detail the CCTU is made of a Chip timing Error Detector (CED) 
that operates on the E/L samples and an update unit which recursively 
updates the integer delay and the fractional epoch input to the LIU. The CED 
(shown in Figure 3-22) is the traditional non-coherent E/L correlator with 
time offset equal to one chip and full symbol correlation. The update rate of 
the CCTU output parameters is thus equal to the symbol rate (one CED 
output per symbol time). In order to ease clock tracking the CCTU performs 
correlation of the received samples with a local replica of the pilot signature 
code. Just to reduce implementation complexity, the squared amplitude 
nonlinearity of traditional E/L CEDs is replaced by a simpler amplitude 
nonlinearity. The relevant performance difference was shown to be 
negligible by simulation. The CED output signal is finally scaled by an 
amplitude control signal provided by the SACU, resulting in the arrangement 
sketched in Figure 3-22. 

The CCTU is also equipped with the Lock Indicator shown in Figure 3- 
23 which signals completion of the timing lock procedure. The lock signal is 
obtained through a number of steps: first, the CED output E k is low pass 
filtered with the same bandwidth as the CCTU loop bandwidth; then the 
filter output is rectified; and finally the lock condition is tested through a 
comparator with hysteresis. The latter feature prevents possible sequences of 
repeated lock/unlock indication in a noisy environment. 



P 



AGC 



l/Q samples 

from 

Front-End 



from SACU 




to EC-BAID, FED 



Figure 3-21. CCTU/LIU Architecture. 

The initial state of the smoothing filter, as well as the comparator 
thresholds, are set according to the average value of the decision variable 
\E k | , the so called M curve, that is shown in Figure 3-24. 

The initial value of the detector status (i.e., of the smoothing filter output) 
E 0 has to be set taking into account the diverse initial sampling errors that 
may occur. 








110 



Chapter 3 




Figure 3-22. CED outline. 




Figure 3-23. Lock Indicator. 




t / T d — t / (T c / 4) 



Figure 3-24. Lock Detector Characteristics (M curve). 

The worst case is x 0 =T c /4 which corresponds to an average CED 
output equal to 0.5. If we want to signal loop lock when the timing error is 
smaller than or equal to 5% of a chip interval, the Tow’ (or inferior) 
threshold must be roughly /, inf = 0.0625 as shown in Figure 3-24 (dash-dot 
line). Also, if we want to signal loss of lock when the error is greater than 
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12.5% of a chip interval we have to set the ‘high’ (or superior ) threshold to 
%up- 0.1X75 . 

Unfortunately, setting the smoothing filter onto a positive value fails 
when the initial timing error is negative. To attain symmetry in this respect, 
it is expedient to resort to the modified lock detection structure shown in 
Figure 3-25, where the two smoothing filters are initialized at the two 
symmetric values -E 0 and E 0 , respectively. In so doing, the behavior of the 
detector will be always symmetric. 




Figure 3-25. Modified Lock Detector. 



2.2 Interpolation 

The output x{kT d ) of the I (or Q) CMF (with k running at the decimated 
sample rate f d = fj p = 4 R c ) are input to an interpolator (LIU) which 
provides the strobes for signal detection and synchronization (addressed to 
as prompt and E/L samples, see Section 2.1.2). Very accurate interpolation 
for band limited signals is in general provided by a third-order polynomial 
interpolator. In our case the digital signal bears a relatively high 
oversampling ratio (i.e., f d /R c .= 4 samples per chip interval), so that a 
simpler linear (first-order) interpolator ensures sufficient accuracy. 

In order to compensate for the drift between the free running clock of the 
receiver ADC and the actual chip clock of the received signal, each 
interpolator is controlled by an estimate of the (time varying) code timing 
delay provided by the CCTU. The signal x(kT d ) running at f d =1/7), is 
then interpolated so as to provide a decimated signal at twice the chip rate 
2R c = 2 / T : . During the generic m th symbol interval, we will have therefore 
2L sampling epochs t m „ for any interpolator such that 

t m,n+l = t m,n + ff ’ (3.54) 

where m runs at the symbol rate 1/7), n runs at twice the chip rate 
2R c = 2 / T c , with 0 < n < 2L - 2 , and where t m 0 is provided and renewed at 
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each symbol interval by the CCTU. The initial sampling epoch t m 0 in each 
symbol interval is in fact updated once per symbol interval as follows 

C,0 = mT s + , (3-55) 

where x m is the control signal provided by the CCTU according to the 
following recursive equation 

U, + i=T„ -YC,,- (3-56) 

In (3.56) e m is the error signal provided by the CED of the CCTU (see 
Figure 3-21), and y is the update step of the algorithm, the so called step 
size which in the following will be also referred as y ccxu . The stepsize y 
must be set as a reasonable trade off between acquisition time and steady 
state jitter performance. From (3.55) we obtain 

* m =tm,o- mT s’ (3-57) 



Urn =t m + i,o-{ m + l ) T s’ (3-58) 

and substituting (3.57) — (3.58) into (3.56) we obtain 

U+i.o - (m + 1 ) 7 ) = L, o " mT s - l e m (3-59) 



and 



t m + l,0= t m,0 +T s-Y e m- (3-60) 

The sequence t m 0 needs, however, further processing in order to produce 
a control signal for the interpolator. From (3.69) it is seen that even if the 
update term ye m takes on small values (say, fractions of a symbol interval 
T s ), the value of t m 0 increases unboundedly with m. To cope with this issue 
we decompose the sampling epochs t m 0 as follows 

C.o =( / m + F„,)7j, (3.61) 



where 
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(3.62) 



is the integer part of t m 0 as measured in clock ticks of the sampling 
frequency f d , and 



P, 




(3.63) 



is the fractional part of t m 0 , expressed again in sampling clock intervals. The 
sampling control signal x m provided by the CCTU is updated every symbol 
interval, and so will also be the two values of /,„ and p m . By substituting 
(3.61) into (3.60) we obtain 

(C+i + P* + i )T d = (/„, + p m )T d + T s - je m , (3.64) 



T y 

(Cl + P„ + 1 ) = (L + P,„ ) + -=r - e m = (l m + P,„ ) + T1 - fe m (3.65) 

‘ d 

where y' = y lT d and r\ = TJT d . Taking the integer part of both sides in 
(3.65) we obtain 

L + 1 = L + T1 + int {p,„ - fe m } , (3.66) 

where we have assumed that the oversampling ratio q is an integer value. 
By taking the fractional part of (3.65) we obtain instead 

b,„+i =frac{p„, - y’e m } . (3.67) 



Equation (3.66) can also be cast into the form 

L + 1 - L =4 + int {p,„ ~fe n } , (3.68) 

whose right hand side term represents the number of input samples to be 
shifted into the interpolator until the next output is computed. 

Once l m and \i m are computed, the output of a third-order interpolator is 
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) = X c < (X ) • 



{L~i)T d +n y 



0< n <2L - l . 



(3.69) 



The meaning of f m _„, and p m for the third-order interpolator is 
illustrated in Figure 3-26. 



x[(lm-l)Td+nTc/2] x[(lm+ 1 )T d+nT c/2] x[(lm+2)T d+nT c/2] 




( lm- I )Td+nT c/2 lmTd+nT c/2 tm,n ( I m+ 1)1 d+nT c/2 (lm+2)T d+nT c/2 



M 

pmTd 



Figure 3-26. Meaning of the 3rd-order interpolator parameters. 

The coefficients C, (p) ( -2 < / < 1 ) are given by 

C 2 (p) = ^-H (3.70) 

6 6 



C,(p) 




(3.71) 



CoM 



0 2b, 

— +u - — + 1. 

2 2 



(3.72) 



C 1 (p) = -i l - + ^--^, (3.73) 

v ; 6 2 3 

and the block diagram of the interpolator is depicted in Figure 3-27. It is 
seen that the implementation is that of an FIR filter with variable 
coefficients. 
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Clock for the update of the interpolator's epoch 
(twice the chip rate) 



Figure 3-27. Architecture of the 3rd-order interpolator. 



x(lmTd+nTc/2) 



y(tm,n) 



x[(lm+l )T d+nT c/2] 



1 1 1 >■ 

lmTd+nTc/2 tm.n (lm+ 1 )T d+nT c/2 

I M 

JdmTd 



Figure 3-28. Linear interpolation. 

A simpler hardware architecture is obtained by resorting to a first-order 
(linear) interpolator. Interpolation for this simpler case is depicted in Figure 
3-28, and the output samples are then given by 
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y{ t m,n) = X ( l m T d +n y 

+ P m I* (/* +!)?; +»y ~x[l m T d +»y | • (3.73) 

By re-arranging (3.73) we obtain 

t(C,„ ) = (1 “ lb, ) + » y + fh, * (l m + l ) T d + n y , (3.74) 

which can be cast into a form similar to (3.69) 

y{ t m,n) = y Z C iM x (^O r rf+«v ( 3 - 75 ) 

with 

C o(b) = l-b> (3.76) 



C,(p) = p. (3.77) 

The architecture of the linear interpolator is finally depicted in Figure 3- 
29. 

When implemented in fixed point arithmetic, the linear interpolators will 
be affected by quantization errors. The input signal x{kT d ) is replaced by a 
quantized signal 

x'(kT d ) = (-\) s ^ x . n Y j bt ) -2 k , (3.78) 

k = 0 

where n is the word length, s denotes the sign bit, {b[ x) } , with 
0<k<n c -2 and b[ x) g{0,1}, is the code word for the absolute value of 
x\kT d ) and A v is the quantization step. The parameters n c and A x must be 
chosen so as not to introduce significant distortion in the representation of 
the samples x(kT d ) . In particular, the peak value x max of the signal should 
be such that 



2" c ~' A v = x t 



(3.79) 
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Clock for the update of the interpolator's epoch 
(twice the chip rate) 



Figure 3-29. Architecture of the linear interpolator. 

All of the coefficients C,(p) of the interpolator have absolute values 
smaller than unity for any p in the interval [0,1]. Therefore they can be 
quantized by n c bits (as the input samples) 

C;(n) = (-l)'-A c -"£V ) -2 i (3.80) 

k = 0 



with the same notation as above, and where 

2" C ~ 1 A C =1 . (3.81) 

The samples y(t n ) are generated according to (3.54), and their quantized 
version y'(t n ) is then 



/(O = (-!)' 



•A c -A ; 



• 2 >: 



w, 



(3.82) 



Our results indicate that the quantization step in the representation of 
y'(t n ) can still be assumed to be equal to A x with no significant 
performance impairment. This suggests that some bits can be dropped in the 
expression of y\t n ) in order to obtain a signal representation with the same 
complexity as for x\kT d ) . As is seen in Figure 3-30, the n c - 1 LSBs and 
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the two MSBs are neglected. Word length reduction is carried out so as to 
emulate signal clipping as follows: 

i) if both MSBs are zero, then the remaining n c - 1 bits are left 
unchanged; 

it) if at least one MSB is nonzero, then all of the remaining n c - 1 bits are 
set to 1 . 

Once the n c - 1 LSBs and the two MSBs are dropped, the real and the 
imaginary parts of y(t n ) can be represented as follows 

y'(t n ) = (-iy-A x -£b ( / ) -2*. (3.83) 

k= 0 

Clearly the binary symbols b[ y) in (3.82) and (3.83) are not the same, but 
we preferred to retain the same notation for simplicity. 

As discussed above the outputs of the I and Q interpolators, running at 
the rate 2 R. , are eventually demultiplexed in two low rate ( R c ) signals by 
two demultiplexers, yielding the prompt and E/L sample streams. The 
outputs of the interpolators at twice the chip rate (2 R . ) are also used by the 
CCAU (Section 2.1.1) for coarse code timing recovery. In this code 
acquisition mode the CCTU is inactive, and the sampling epoch of the 
interpolators are arbitrary and constant in time. 



2 bits nc-1 bits nc-1 bits 

/ \/ \/ \ 




2n^1 2nc-2 2n^3 n^1 nc-2 0 



Figure 3-30. Bit reduction in the representation of the samples at the interpolator output. 



2.3 Carrier Synchronization 



2.3.1 Carrier Frequency Synchronization 

Initial code acquisition may be severely impaired when the initial carrier 
frequency offset, denoted as v , owed to residual Doppler shift and/or 
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instability of the local oscillator in the receiver is comparable to the inverse 
of the symbol period T s . Frequency offset causes a sort of ‘decorrelation’ of 
the observed signal within the coherent integration window of the serial 
acquisition scheme discussed above that can be quantified in a power loss 
figure. For instance, it can be shown [Syn98] that a carrier frequency offset 
equal to half the symbol rate yields a coherent integration loss of about 4 dB, 
and far higher losses have to be taken into account for larger frequency 
errors. Unfortunately, estimation of the carrier frequency offset cannot be 
carried out reliably unless code is coarsely acquired. This ‘chicken or egg’ 
problem has no simple solution: the only viable approach is a sort of joint 
two-dimensional time/frequency acquisition over the possible code epochs 
and over a number of pre-determined frequency bins within an assigned 
uncertainty interval. 

Initial frequency uncertainty is especially an issue when dealing with 
Low or Medium Earth Orbiting satellites (LEO/MEO). Even feeder link pre- 
compensation techniques will not prevent the residual downlink Doppler 
shift from being larger than the symbol rate in coded voice communications. 
The difficulty of carrier frequency acquisition is another facet of the 
wideband characteristic of the DS/SS signal. Actually, both in narrowband 
and in SS modulations one has to determine the carrier frequency with an 
accuracy much smaller than the symbol rate to ensure good data detection. 
Clearly this estimation task is apparently harder to accomplish when the 
bandwidth of the observed signal is many times greater than the symbol rate, 
as in wideband modulation. A survey of synchronization techniques for 
CDMA transmissions is presented in [Syn98], 

Upon completion of raw acquisition of initial code phase and carrier 
frequency, the (small) residual offset can be removed at baseband on the 
symbol rate signal resulting form despreading/accumulation. The raw 
frequency offset estimated during the acquisition phase as above is used to 
correct the local oscillator frequency, and the residual frequency error after 
despreading is reduced to a small fraction of the symbol rate. Fine frequency 
offset compensation can be performed resorting to conventional closed loop 
frequency estimation/correction techniques [DAn94]. In particular, a few 
algorithms have been analyzed and experimented with in CDMA modems: 

i) Rotational Frequency Detector (RFD) [Cla93]; 

ii) Dual Filter Discriminator (DFD) [Alb89]; 
in) Cross-Product AFC (CP-AFC) [Nat89]; 
iv) Angle Doubling AFC (AD-AFC). 

Further details on frequency error detectors to be implemented in a digital 
receiver are found in the overview paper [Moe94] and a further example of 
such techniques is also described in [DAn94], 
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Figure 3-31 shows the overall architecture of the AFCU implemented in 
the MUSIC receiver, with an indication about the different processing rates 
in the different circuit parts: f c represents here the sampling rate of the 
ADC, f s is the symbol rate, L is as usual the spreading factor, and N = 4p 
is the number of samples per chip. 



from 

ADC 




Figure 3-31. Block diagram of the AFC unit. 

As is apparent, we use here a ‘long loop’ approach, in which the Multi- 
Rate Front End Unit (MRFEU) is encompassed by the loop. This does not 
harm loop stability, since the relevant processing latency is definitely 
negligible with respect to the intrinsic response time of the AFCU as a 
whole. The output on time samples of the UIU at chip rate are despread with 
the pilot code. The resulting symbol time samples are processed by the 
SACU and are subsequently sent to the Frequency Difference Detector 
(FDD). The latter outputs in turn the frequency error signal e v (k) which is 
filtered by the loop filter according to the following recursive equation, to 
give the updated estimate v(£) (at symbol rate 1 ) of the frequency offset v 

v(£ + l) = v(£)-y<? v (£), (3.84) 

where y is the step size of the frequency tracking algorithm (which in the 
following will be also referred as y AFCU ). The frequency estimate v(/c ) 



1 When using a pilot channel to perform frequency control, we could also lengthen our 
coherent despreading interval with respect to a symbol period, and slow down accordingly 
the updating rate. This would probably make the loop more robust to noise, but makes it 
more sensitive to a large initial frequency offset, which can be in the MUSIC receiver as 
high as 10% of the symbol rate. This is why here we have stuck to symbol time integration 
and symbol rate updating. 
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drives the DCO which in turn provides the complex-valued oscillation 
exp { — jxj/(«)} at the ADC sampling rate. The instantaneous phase v| i(n) of 
such oscillation is generated by the DCO according to the recursive equation 
(running of course at sampling frequency rate) 



\|/(« + l) = \|/(n) + 



2nv(k) 

m 



mod(2jt), 



(3.85) 



where the sampling-rate index n is related to the symbol rate index k 
according to k = int {n / NL) . The counter-rotating complex- valued 
exponential generated by the DCO with in s tantaneous phase (3.85) is used 
by the digital downconverter in Figure 3-7 to remove the frequency offset 
from the I/Q received samples. If we denote with x(k) the symbol time 
signal at the SACU output (see Figure 3-31), the frequency error signal 
e v ( k ) provided by the FDD (see Figure 3-32) is 

e v (k) = 3m{x(£)x* (k - 2)}- . (3.86) 

Assuming ideal chip timing recovery and neglecting for the moment the 
contribution of channel noise, we have 

x[k) = Ae^ 2mkT ‘ + ^ +r| (k) (3.87) 

where v is the frequency offset, T s is the symbol interval and r\(k) is the 
MAI contribution. With (3.87) in (3.86) we find 

e v ( k ) = A 2 sin(4jtvT s ) + p(&) , (3.88) 

where 

p(/t) = T 1 (Ar)e' j[2,tv(t - 2)7;+9] + 1 !* (k - 2) e j(2revAT ' +<p) 

+ r|(A:)r|*(A:-2) . (3.89) 

We want now to obtain an expression for the average characteristics of 
the FDD (the so called S curve). Therefore we compute the statistical 
expectation of e v (k) conditioned on a fixed value of the frequency offset v . 
This computation is easily done if we observe that r\{k) and r\(k - 2) have 
zero mean and are statistically independent. 

The latter property comes from the observation that two MAI samples 




